
USPTO 



STIC BC 2100 -■(oviad 
Search Request Form ( Ul 



Today's Date: 



What date would you like to use to limit the search? 

Priority Date: 7 /v /<=>) Other: 



Name 



AU Ml 2. Examiner # Vl r <f) 

Room# 4/?>£ Phonel^£i?ll^. 
Serial* O^/m.m 



Format for Search Results (Circle One): 

;£app*. disk email 

Where have you searched so far? 

<<f§B DWPI EPO JPO ACM IBMTDB 
IEEE INSPEC SPI Other. 



Is this a "Fast & Focused" Search Request? (Circle One) NO 

A "Fast & Focused" Search is completed in 2-3 hours (maximum). The search must be on a very specific topic and 
meet certain criteria. The criteria are posted in EIC2100 and on the EIC2100 NPL Web Page at 
http://ptoweb/patents/stic/stic-tc2100.htm. 



What is the topic, novelty, motivation, utility, or other specific details defining the desired focus of this search? Please 
include the concepts, synonyms, keywords, acronyms/definitions, strategies, and anything else that helps to describe 
the topic. Please attach a copy of the abstract, background, brief summary, pertinent claims and any citations of 
relevant art you have found. 



STIC Searcher (3eo f ^CT-tu "SrT-Leore<T 



Date picked up fS 



_ Phone cSOg- 
Date Completed_ 



7SOO 




Rcnourctu: Adro5nw*rut«n 



USPTO 



STIC Database Tracking mSSm 



TO: Chongshan Chen 

Location: 

Art Unit : 2172 

Friday, August 08, 2003 

Case Serial Number: 09/897803 



Search Notes 



From: Geoffrey St. Leger 
Location: EIC 2100 
PK2-4B30 
Phone: 308-7800 

geoffrey.stleger@uspto.gov 



m- ^ ^ 



Dear Examiner Chen, 

Attached please find the results of your Fast & Focused search request for application 09/897803. 1 searched 
Dialog's foreign patent files, technical databases, product announcement files and general files. 

Please let me know if you have any questions. 

Regards, 




G<£oJtrey 
4B30/308-7800 




- Soarcfi aivi tniormtticn 
ReoourcM Administration 



File 8:Ei Compendex(R) 1970^-2003/ Jul W4 

(c) 2003 Elsevier Eng. Info. Inc. 
File 35: Dissertation Abs Online 18 61-2003/ Jul 

(c) 2003 ProQuest Inf o&Learning 
File 202:Info. Sci . & Tech. Abs. 1 966-2003/ Jul 31 

(c) Information Today, Inc 
File 65: Inside Conferences 1993-2003/Aug Wl 

(c) 2003 BLDSC all rts . reserv. 
File 2:INSPEC 1 969-2003/ Jul W4 

(c) 2003 Institution of Electrical Engineers 
File 233: Internet & Personal Comp. Abs. 1981-2003/ Jul 

(c) 2003 Info. Today Inc. 
File 94 : JICST-EPlus 1 985-2003/ Jul W4 

(c)2003 Japan Science and Tech Corp(JST) 
File 603:Newspaper Abstracts 1984-1988 

(c)2001 ProQuest Inf o&Learning 
File 483: Newspaper Abs Daily 1 98 6-2003/Aug 07 

(c) 2003 ProQuest Inf o&Learning 
File 6:NTIS 1964 -2003/Aug W2 

(c) 2003 NTIS, Intl Cpyrght All Rights Res 
File 144:Pascal 1 97 3-2003/ Jul W4 

(c) 2003 INIST/CNRS 
File 434 : SciSearch ( R) Cited Ref Sci 1974-1989/Dec 

(c) 1998 Inst for Sci Info 
File 34 : SciSearch { R) Cited Ref Sci 1 990-2003/Aug Wl 

(c) 2003 Inst for Sci Info 
File 99:Wilson Appl . Sci & Tech Abs 1 983-2003/ Jun 

(c) 2003 The HW Wilson Co. 
File 583:Gale Group Globalbase (TM) 1 98 6-2002/Dec 13 

(c) 2002 The Gale Group 
File 266:FEDRIP 2003/Jun 

Comp & dist by NTIS, Intl Copyright All Rights Res 
File 95 :TEME-Technology & Management 1989-2003/ Jul W3 

(c) 2003 FIZ TECHNIK 
File 438: Library Lit. & Info. Science 1 984 -2003/ Jun 

(c) 2003 The HW Wilson Co 



Set 


Items 




Description 


SI 


737349 




DATABASE? ? OR DATA () BASE? ? OR REPOSITOR??? OR DBM OR DBMS 






OR RDBM OR RDBMS 


S2 


5110 




S1(5N) (DUPLICAT? OR REPLICA? OR COPY??? OR COPIE? ? OR REP- 




RODUC?) 


S3 


33732 




PARTITION? { 5N) { DETERMIN? OR ESTIMAT??? OR ANALYZ? OR ANALY- 




S? 


OR ASSESS? OR CALCULAT? OR ASCERTAIN? OR COMPUTE OR COMPUT- 




ES 


OR COMPUTED OR COMPUTING OR GAUG? OR EVALUAT? OR FIGURED OR 






FIGURING OR MEASUR? OR DEFIN?) 


S4 


7339 




PARTITION? {5N) {SIZE? ? OR SIZING OR BOUNDAR??? OR RANGE? ? 




OR 


EXTENT? ? OR MAGNITUDE? ?) 


S5 


2532113 




SAMPL??? 


S6 


1509046 




STATISTIC?? 


S7 


0 




S2 AND S3 AND S5 AND S6 


S8 


0 




S2 AND S4 AND S5 AND S6 


S9 


24 




SI AND S3:S4 AND S5 AND S6 


S10 


23 




RD (unique items) 


Sll 


19 




S10 NOT PY=2002:2003 


S12 


184 




SI AND S3:S4 AND S5:S6 


S13 


31 




S2 AND S3 


S14 


5 




S2 AND S4 


S15 


32 




S13:S14 


S16 


25 




RD {unique items) 


S17 


44 




S4 (10NJS6 


S18 


926 




S4(5N) (DETERMIN? OR ESTIMAT??? OR ANALYZ? OR ANALYS? OR AS- 



SESS? OR CALCULAT? OR ASCERTAIN? OR COMPUTE OR COMPUTES OR CO- 
MPUTED OR COMPUTING OR GAUG? OR EVALUAT? OR FIGURED OR FIGURI- 
NG OR MEASUR? OR DEFIN?) 



519 149 S5:S6 AND S18 

520 3 SI AND S19 

521 3 RD {unique items) 



522 10 S18(15N)S6 

523 9 RD (unique items) 

524 12 S21 OR S23 

525 26 S18 AND SI 

526 22 RD (unique items) 

527 22 S21 OR S26 

528 23 S18(15N)S5 

529 18 RD (unique items) 



11/5/2 (Item 2 from file: 35) 

DIALOG (R) File 35 : Dissertation Abs Online 
(c) 2003 ProQuest Inf o&Learning . All rts. reserv. 

01808815 ORDER NO: AADAA-I 9938 128 

Novel computational methods for drug design and discovery: Recursive 
partitioning analysis of pharmaceutical database , automated 
pharmacophore identification, and fast free-energy calculations 

Author: Chen, Xin 
Degree: Ph.D. 
Year: 1999 

Corporate Source/Institution: The University of North Carolina at Chapel 

Hill (0153) 
Adviser: Alexander Tropsha 
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This dissertation is composed of three parts. Each of them describes a 
new computational method specifically developed for assisting the rational 
drug design and discovery, either ligand-based or receptor-based. 

Recursive partitioning is a powerful data mining technique and has 
been successfully applied to large chemical data sets like HTS data sets, 
however, the previous work was limited to 2D descriptors, while medicinal 
chemists believe that drug molecules exert their pharmaceutical functions 
in the three dimensions. So, reported here is my work extending the former 
recursive partitioning analysis 1 8 to the three dimensions, using 3D 
Sldquo; atom” pairs as molecular descriptors. Correct 3D 
structure-activity relationships were successfully derived from a data set 
Containing 1,644 monoamine oxidase inhibitors. 

Based on the successful 3D recursive partitioning work, a novel 
computational program, SCAMPI ( Statistical Classification of Activities 
of Molecules for Pharmacophore Identification) , is developed for 
identifying pharmacophores from large chemical data sets. SCAMPI combines 
recursive partitioning and fast conformational search methods and make them 
dependent on each other in the pharmacophore identification process, by 
following the adaptive sampling strategy. The recursive partitioning 
algorithm implemented in SCAMPI belongs to the class of CHAID algorithms. 
The conformational search algorithm in SCAMPI is developed based on the 
“ differential distance equation” algorithm. Presently, SCAMPI 
is able to derive pharmacophores from 1 , OOO– 2, 000 compounds within 
one day of computation on a SGI R10000 machine. 

For receptor-based drug design, a generalized linear response method 
is developed for facilitating the hydration and binding free energy 
calculations. This new method is based on the standard linear response 
approximation and extends it to the van der Waals contribution term. 
Compared with other linear response methods for free energy calculations, 
this method does not contain any new empirical parameters. This method has 
been tested for hydration and binding free energy calculations and 
demonstrated to provide the calculated results consistent with the 
experimental data in the both cases . 
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This research study presents the mathematical basis for building the 
MC-HARP data-processing environment. The MC-HARP strategy determines the 
functional structure and parameters of a mathematical model simultaneously. 
A Monte Carlo (MC) strategy combined with the concept of Hierarchical 
Adaptive Random Partitioning (HARP) and fuzzy subdomains determines the 
multivariate parallel distributed mappings. The constructed mapping can be 
modeled as a neural network. The HARP algorithm is based on a 
divide-and-conquer strategy that partitions the input space into 
measurable connected subdomains and builds a local approximation for the 
mapping task. Fuzziness promotes continuity of the mapping constructed by 
HARP and smooths the mismatching of the local approximations in the 
neighboring subdomains. The Monte Carlo superposition of a sample of 
random partitions, reduces the localized disturbances among the fuzzy 
subdomains, controls the global smoothness of the mean average mapping, and 
improves the generalization of the constructed mapping. 

The tree structure of the HARP modules and the independence of both 
the subdomain approximations and the random partitions enable the MC-HARP 
environment to quickly converge to a series of equally plausible solutions 
without user interaction. The MC-HARP environment enjoys a large-scale 
granularity produced by the Monte Carlo parallelism and the geometric 
parallelism achieved by partitioning the input space. Therefore this 
environment can exhibit good performance on parallel computers for large 
and complex scientific databases . 

The developed MC-HARP philosophy for building data - based 
approximate mappings leads to a novel model selection criterion and an 
original framework for classifying data-fitting problems. The MC-HARP 
environment not only can build approximate multivariate mappings with 
self-organization capability, noise and fault tolerance, adaptivity, 
generalization, highly plastic and stable learning characteristics with 
respect to the addition of new data points, and parallel structure but also 
can answer fundamental questions in data - based mathematical modeling. 
These questions include: (1) What is the confidence level for each 
predicted output of the constructed model? (2) What is the approximation 
confidence measure for the constructed model? (3) How does the functional 
complexity of the actual multivariate mapping change over the input space? 
(4) What is the suitable structural complexity for a data - based model 
using noisy data? (5) What is the level of noise in the data? (6) Is the 
amount of training data adequate? If not, which regions of the input space 
need more data? (7) Is the selected parametric model suitable? (8) What is 
the conditioning of a data-fitting problem? (9) Is data - based 
mathematical modeling promising for the given task? 

The developed MC-HARP environment can support the diverse needs of the 
scientific and engineering community. It has the versatility to develop and 
verify parametric and nonparametric mathematical models and also global and 
local approximate mappings. Furthermore, It establishes an environment for 
unifying existing mathematical modeling techniques in statistics , 
approximation theory, information theory, system identification, and neural 
networks. 
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Statistical pattern recognition techniques may be applied to 
cardiograms for automated diagnosis. The three vectorcardiograph^ signals 
in the Frank-orthogonal-lead system are expressed as truncated 
Karhunen-Loeve expansion in terms of a set of time-varying orthogonal basis 
vectors. These vectors are derived from the second-order statistics of 
the data. In addition to the basic formulation of the algorithm, an elegant 
proof of its minimizing property is presented. 

An ensemble of 670 cardiograms is being used to train the 
algorithm, and the resulting pattern vectors are clustered in a 
multidimensional features space. Baseline restoration is first performed on 
the data using a true third-order spline technique for best Y ( t ) -estimate 
of the baseline. Ordinates, Y, in the estimates are directly deducted from 
the P-Q interval of the waveform. The cardiogram is an ensemble of 
quasi-stationary processes; this is due to variations in both R-R and P-R 
intervals. As a feature extractor, the K-L expansion is optimal compared 
with Fourier-of performed on non-stationary processes. To achieve 
optimality, the heart is segmented (time partitioned) into two processes, 
namely the P-wave and QRST segment, and each segment is aligned on its 
fiducial point. The R-wave fiducial point is detected by searching the 
magnitude of the vector velocity for maxima. The P-wave fiducials are 
located via a new multi template correlation algorithm. 

Two separate K-L expansions are performed on each process. An 
ensemble-global K-L expansion is performed on the P-processes, to compute 
the P-basis vectors. Further, the ensemble is partitioned 

(ensemble-partitioning) into three partitions: (1) Gross-abnormal: This is 
the partition of gross depolarization abnormals in the QRS Complex (LBBB, 
RBBB, etc). (2) All-But-Gross: This is the partition of all other 
abnormalities. (3) QRS Suppressed: This is the partition of certain 
repolarization abnormals and normals (ST, T, etc) . A partition-global K-L 
expansion is then performed on the QRST process in each partition, yielding 
a set of K-L vectors for each. The underlying concept here is, since the 
class distribution probabilities are not known apriori, a much more 
efficient feature extractor would result if the ensemble is partitioned. 

In addition, since the K-L expansion is optimal using the least 
mean square error criteria, and since the PQRST is heavily weighted by the 
QRS complex (energy wise) , and to improve on classification accuracies in 
the post QRS segment, the waveform is time weighted {or QRS-Suppressed) 
before computing the partition base functions. This is accomplished by 
multiplying each time-varying sample of the partition by a weighting 
function. By suppressing the QRS, the basis vectors best represent 
repolarization classes. 

The approach to the pattern recognition problem is hierarchical: 
(1) Find a first-cut classification of pattern vectors-using 
ensemble-global K-L expansion. (2) Pursue a much more accurate 
diagnosis/classification using partition-global K-L expansion. 

To completely formulate the classification problem, the structure 
of the feature space is studied, using a fuzzy clustering algorithm with 
supervised seeding and class-dependent fuzziness. The underlying concept 
here is, since classes in the feature space are overlapping to various 
degrees, parametri zat ion is best estimated using the fuzzy approach. This 
is an extremely innovative concept (compared with hard-clustering) in 
handling the following two problems: (1) It allows for slight 
misclassif ication errors on part of the Cardiologist (is the diagnosis 100% 
certain abnormality or is it 95% and 5% others?) (2) It gives quantitative 
measure of probabilities of each of a disease state in multiple-diagnosis 
vectorcardiograms. Probability measures are proportional to some membership 
function measures. 

The clusterer described above is performed on a subset of the 
data - base that includes relatively nonempty sets of pure classes and one 
multiple diagnosis class. Members of the multiple diagnosis class are found 
to be best characterized as being cases with multiple membership functions 
to the adjacent pure classes, rather than being a class of their own. 
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A method of sorting large textual data - bases by computer using 
external storage is proposed. The range of sort-keys in a sample of data 
to be sorted is divided into a fix set of partitions, which should also 
give an adequate representation of new data from a similar source. The 
partitions are composed of ordered key ranges . An incoming data stream 
is distributed into a series of bins according to the partition in which 
the key lies, and the bins are then separately sorted, using an internal 
sort, to give an ordered file. It is shown how the number of disc accesses 
needed depends on the manner in which the bins become filled, and thus on 
statistics of the data. Experiments using an inspec data - base give 
information on which estimates of the efficiency of the method can be based 

Classification Codes and Description: 5.01 (File Design, Building, and 
Updating) 

Main Heading: Information Processing and Control 
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Abstract: Discusses the evaluation of a clustering solution. Criteria 
based on the number of clusters and discrimination and classification 
processes are used to evaluate the clustering solution. The proposed 
approach is based on two paradigms: statistics and machine learning. A 
multi-methodological approach is advocated in the construction of models 
associating properties with clusters, to provide a wider and richer set of 
analysis perspectives and better knowledge discovery. Specifically, the 
construction of classification and discrimination logical models as a 
complement of quantitative statistical models is particularly useful when 
most of the available information is of a qualitative nature. Both the 
classification's global precision and the comprehension added by the 
discriminant model to the association between variables and clusters are 
essential to evaluate a clustering solution. Depending on the dimension of 



the sample , the descriptive analysis performed can be validated by 
partitioning the total sample into two or by other procedures of 
cross-validation. The proposed evaluation approach is applied to a 
marketing/tourism case study. The clustering solution is built upon a 
sample of more than 2,500 Portuguese clients of Pousadas Portugal Hotels. 
The database includes variables related to the evaluation of client stays 
at the Pousadas and profiles of the surveyed clients on holidays, 
demographic and psychographic aspects. Measures of association, chi /sup 2/ 
tests, ANOVA, discriminant analysis, logistic regression and rule induction 
are applied in evaluating the clustering solution built through a K-means 
process. (14 Refs} 
Subfile: C 

Descriptors: data mining; hotel industry; learning (artificial 
intelligence) ; marketing; pattern classification; pattern clustering; 
statistical analysis 
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Abstract: Establishes the mathematical basis for building the MC-HARP 
data-processing environment. The MC-HARP strategy determines the functional 
structure and parameters of a mathematical model simultaneously. A Monte 
Carlo (MC) strategy combined with the concept of Hierarchical Adaptive 
Random Partitioning (HARP) and fuzzy subdomains determines the 
multivariate parallel distributed mapping. The HARP algorithm is based on a 
divide-and-conquer strategy that partitions the input space into 
measurable connected subdomains and builds a local approximation for the 
mapping task. Fuzziness promotes continuity of the mapping constructed by 
HARP and smooths the mismatching of the local approximations in the 
neighboring subdomains. The Monte Carlo superposition of a sample of 
random partitions reduces the localized disturbances among the fuzzy 
subdomains, controls the global smoothness of the mean average mapping, and 
improves the generalization of the approximation. The authors illustrate 
the procedure by applying it to a two-dimensional surface fitting problem. 
(23 Refs) 
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Statistical analysis of large data sets often requires an initial data 
editing and preparation phase to check the validity of individual data 
items, check for consistency among related data, correct erroneous data, 
and supply (impute) values for missing data where possible. During this 
preparatory phase of analysis , it is often necessary to partition the 
data set into a number of subsets by logical selection and/or random- 
sampling techniques for purposes of hypothesis testing. This paper 
examines the data-management support required by these editing and 
subsetting operations in terms of lower-level data-manipulation functions 
and mappings between logical and physical data structures. Advantages of 
transposed data files for statistical applications are discussed in 
comparison with record-based structures. A specific self -describing 
transposed-f ile design is described in detail, with emphasis on 
representations of logical data structures commonly encountered in 
statistical databases . (ERA citation 08:043115) 

Descriptors: Statistics ; Data Processing; Validation; Corrections; 
Mapping 

Identifiers: ERDA/990200; NTISDE 

Section Headings: 62B (Computers, Control, and Information 
Theory—Computer Software) 
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The goal of record linkage is to link quickly and accurately records that 
correspond to the same person or entity. Whereas certain patterns of 
agreements and disagreements on variables are more likely among records 
pertaining to a single person than among records for different people, the 
observed patterns for pairs of records can be viewed as arising from a 
mixture of matches and nonmatches. Mixture model estimates can be used to 
partition record pairs into two or more groups that can be labeled as 
probable matches (links) and probable nonmatches (nonlinks) . A method is 
proposed and illustrated that uses marginal information in the database 
to select mixture models, identifies sets of records for clerks to review 
based on the models and marginal information, incorporates clerically 
reviewed data, as they become available, into estimates of model 
parameters, and classifies pairs as links, nonlinks, or in need of further 
clerical review. The procedure is illustrated with five datasets from the 
U.S. Bureau of the Census. It appears to be robust to variations in 
record-linkage sites. The clerical review corrects classifications of some 
pairs directly and leads to changes in classification of others through 
reestimation of mixture models. 

English Descriptors: Statistical estimation; Linear estimation; 

Statistical regression; Paired comparison; Administrative document; 
Census; EM algorithm; Mixture; Modeling; Iterative method; Maximum 
likelihood; Sample survey; Likelihood function; Statistical theory; 
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Author: Chinchwadkar , Gajanan S . ; Goh, Angela; Lim, Ee-Peng 
Corporate Source: Nanyang Technological Univ, Singapore, Singapore 
Conference Title: Proceedings of the 1997 1st International Conference on 

Information, Communications and Signal Processing, ICICS. Part 2 (of 3) 
Conference Location: Singapore, Singapore Conference Date: 

19970909-19970912 
Sponsor: IEEE 

E.I. Conference No.: 48010 

Source: Trends in Information Systems Engineering and Wireless Multimedia 
Communications Proceedings of the International Conference on Information, 
Communications and Signal Processing, ICICS v 2 1997. IEEE, Piscataway, NJ, 
USA. p 800-804 

Publication Year: 1997 

CODEN: 002795 

Language: English 

Document Type: CA; (Conference Article) Treatment: T; (Theoretical) 
Journal Announcement: 9804W4 

Abstract: Vertical Partitioning of Object Oriented Databases (OODBs) is 
a difficult problem. In the present paper, we present simulated annealing 
(SA) approach for generating partitions which are suitable for asynchronous 
parallel processing of queries . We study two cost functions for SA and 
compare the resulted partitions with respect to irrelevant 10, % 
distribution of 10 load for transactions across the processing nodes and 
the standard deviation of the partition sizes which determines the 
load balance in the asynchronous parallel query processing. The results are 
compared with one of the existing vertical partitioning algorithms. (Author 
abstract) 9 Refs. 

Descriptors: Relational database systems; Object oriented programming; 
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Algorithms 
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Conference Title: Proceedings of the Eleventh International Conference on 
Information and Knowledge Management. CIKM 2002 p. 60-7 
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Publisher: ACM, New York, NY, USA 
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/Abstract: In decision support systems, having knowledge on the top k 



values is more informative and crucial than the maximum value. 
Unfortunately, the naive method involves high computational cost and the 
existing methods for range-max querying are inefficient if applied 
directly. We propose a pre-computed partition top method (PPT) to partition 
the data cube and pre-store a number of top values for improving query 
performance. The main focus of this study is to find the optimum values for 
two parameters, i.e., the partition factor (b) and the number of pre-stored 
values (r) , through an analytical approach. A cost function based on 
Poisson distribution is used for the analysis. The analytical results 
obtained are verified against simulation results. It is shown that the PPT 
method outperforms other alternative methods significantly when proper b 
and r values are used. (14 Refs) 
Subfile: C 
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Conference Title: Database and Expert Systems Applications. 11th 
International Conference, DEXA 2000. Proceedings 

Conference Date: 4-8 Sept. 2000 Conference Location: London, UK 
Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: Histograms are used in most commercial database systems to 
estimate query result sizes and evaluation plan costs. They can also be 
used to optimize join algorithms. The authors consider how to use 
histograms to improve the join processing in temporal databases . We 
define histograms for temporal data and a temporal join algorithm that 
makes use of this histogram information. The join algorithm is a temporal 
partition- join with dynamic buffer allocation. Histogram information is 
used to determine partition boundaries that maximize overall buffer 
usage. We compare the performance of this join algorithm to temporal join 
evaluation strategies that do not use histograms, such as a partition-based 
algorithm based on sampling and a part ition- join using the Time Index, an 
index structure for temporal data. The results demonstrate that the 
temporal partition- j oin is substantially improved through the incorporation 
of histogram information, showing significantly better performance than the 
sampling based algorithm and achieving equivalent performance to the Time 
Index join without requiring an index. (12 Refs) 
Subfile: C 
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Conference Title: Proceedings of 8th International Conference on Database 
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Conference Sponsor: Eur. Union; Eur. Res. Consortium for Inf. & Math 
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Abstract: This paper presents a general methodology for the efficient 
parallelization of existing data cube construction algorithms. We describe 
two different partitioning strategies, one for top-down and one for 
bottom-up cube algorithms. Both partitioning strategies assign subcubes to 
individual processors in such a way that the loads assigned to the 
processors are balanced. Our methods reduce inter-processor communication 
overhead by partitioning the load in advance; they enable code reuse by 
permitting the use of existing sequential data cube algorithms for the 
subcube computations on each processor. This supports the transfer of 
optimized sequential data cube code to a parallel setting. The bottom-up 
partitioning strategy balances the number of single attribute external 
memory sorts made by each processor. The top-down strategy partitions a 
weighted tree in which weights reflect algorithm specific cost measures 
like estimated group-by sizes . Both partitioning approaches can be 
implemented on any shared disk type parallel machine. Experimental results 
presented show that our partitioning strategies generate a close to optimal 
load balance between processors. (27 Refs) 
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Language: Korean Document Type: Journal Paper (JP) 
Treatment: Practical (P) 

Abstract: Presents a functional testing tool for the BADA III DBMS , 
which is an object-oriented database system under development in the 
Electronics and Telecommunications Research Institute. The system 
architecture and characteristics of the testing tool, test databases , 
design principles for test cases and implementation issues are described in 
detail. The schema of the test databases is constructed to be suitable 
for the object-oriented paradigm, and the instances are synthesized to help 
the user to understand easily. The test tool features test independence and 
self-evaluation, and has been developed to verify all the functionalities 
of BADA-III/C++ . Each test case has been derived under eight design 
principles that are essentially based on various black-box techniques, such 
as equivalent partitioning , boundary -value analysis and error 
guessing. The testing tool offers 966 test cases in total, in 167 test 
programs. (12 Refs) 

Subfile: C 

Descriptors: object-oriented databases ; program testing; software tools 

Identifiers: functional test suite development; BADA-III/C++; functional 
testing tool; object-oriented database system; system architecture; test 
databases ; test case design principles; implementation issues; database 
schema; instance synthesis; test independence; self -evaluation; black-box 
techniques; equivalent partitioning; boundary- value analysis; error 
guessing; test programs 

Class Codes: C6150G (Diagnostic, testing, debugging and evaluating 
systems); C6160J (Object-oriented databases) 

Copyright 1998, IEE 



27/5/16 (Item 2 from file: 6) 

DIALOG (R) File 6:NTIS 

(c) 2003 NTIS, Intl Cpyrght All Rights Res. All rts. reserv. 

1602992 NTIS Accession Number: AD-A239 326/2 

Object Recognition in Range Images Using CAD Databases 
(Final rept . 1 Feb 89-31 Jul 90) 
Jain, R. 

Michigan Univ., Ann Arbor. Artificial Intelligence Lab. 
Corp. Source Codes: 002797310; 423400 

Sponsor: Air Force Office of Scientific Research, Boiling AFB, DC. 

Report No. : AFOSR-TR- 91-0680 

10 Jul 91 14p 

Languages: English 

Journal Announcement: GRAI9123 

Order this product from NTIS by:' phone at 1-800-553-NTIS (U.S. 
customers); (703)605-6000 (other countries,); fax at (703)321-854 7; and 
email at orders@ntis.fedworld.gov. NTIS is located at 5285 Port Royal Road, 
Springfield, VA, 22161, USA. 

NTIS Prices: PC A03/MF A01 

Country of Publication: United States 

Contract No.: AFOSR-8 9-027 7 ; 2304; A7 

An aspect graph plays an important role in three dimensional object 
recognition. Its represents the three-dimensional shape of an object by its 
two dimensional qualitative views as seen from various viewpoints. To 
create the aspect graph of an object, the viewpoint space is partitioned 
into regions, each of which corresponds to qualitatively similar 
projections of the object. Algorithms for creating aspect graphs of 
polyhedral objects have been developed. We developed an algorithm to 
compute the aspect graph of a curved object. Our approach partitions the 
viewpoint space by computing boundary viewpoints from the shape 

descriptions of the object given in a computer aided design database . 
These computations are formulated from the understanding of visual events 
and the locations of corresponding viewpoints. We also studied new visual 
events for piecewise smooth objects. 
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ABSTRACT: Due to the large size of modern databases , it might be useful 
to divide an entire database into smaller partitions. Three basic forms 
of partitioning exist: round robin, hash-based, and range. For specific 
purposes, hybrids of all three can also be made. Round robin, the easiest 
way to partition , guarantees almost equal partition size . With 
range partitioning , a certain defined range of a record value goes 
into the various data stores. Hash-based partitioning is a more abstract 
form of range partitioning. 
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Look Before You Leap. (Technology Information) 
SARADHI, VI JAY; SIMONEAU, MARTIN 
Intelligent Enterprise, 4, 3, 40 
Feb, 2001 

LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 2784 LINE COUNT: 00223 

...ABSTRACT: re-engineering. Managers must perform extensive evaluations 
of software, hardware and the overall data model before designing the ETL 
process. The next steps are to size the database and determine 
partitioning strategy. Building the actual prototype involves first 
identifying its focus and then creating database objects using the models 
and tools selected. A prototype should be populated with a good-size 
sample of real data if possible. Generating reports and running ad-hoc 
queries helps test a warehouse. The final step is to digest the results 
from. . . 
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PCI 2.1 compliant. The GFX-500F costs $119. 
SOFTWARE 

Candle's IntelliWatch Pinnacle 99 for Lotus Notes features new 
replication monitoring and assurance capabilities. Statistics measure 
replication performance at the individual database level. It also offers 
automatic detection, correction, and problem notification. IntelliWatch 
Pinnacle 99 costs $4,800 per single partitioned server. ... OnMark 2000 
Assess 4.0 from Viasoft scans PCs for Y2K issues in hardware/BIOS, 
applications, and data files. It automatically expands two-digit years in 
Excel spreadsheets and lets organizations scan compressed and archived 
databases for Y2K compliance. OnMark 2000 Assess 4.0 costs $4 9. 
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... of Red Brick Warehouse consists of three components: a database 

server, a load subsystem, and gateway technologies for client/server 
access . 

Red Brick's relational database server was designed to support 
databases typically larger than 500GB with billions of records. It uses 
compact representations for numeric data and compressed ... it employs 
parallel scanning, parallel joining, and trademarked technologies it calls 



parallel-on-demand and parallel SuperScan. Using parallel-on-demand, the 
Red Brick query analyzer partitions queries for the optimal degree of 
parallelism, where it considers the query's complexity, the tables' 
partitioning, and the available resources. For example, it allocates... 
...execution methods for the subsequent steps. In this way, it eliminates 
the perfor mance problems sometimes introduced when a query optimizer uses 
out-of-date statistics . 

Red Brick supports conventional B-tree, star, and target indexes for 
different types of queries. Star indexes are automatically built when 
tables are created -- they. . . 
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trees (hierarchical relationships between data values) . Celko shows 
how to solve problems that many people claim SQL is incapable of handling, 
such as calculating simple statistics (median, mode, variance, and 
standard deviation), running totals, rankings, and subsets (that is, top 
10), and cross tabulations. Most examples are based on ANSI/ISO SQL-92, but 
the author also discusses proprietary features in popular SQL DBMS 
products . 

This is definitely a tips and tricks book, but Celko never neglects to 
explain why some approaches work better than others. He frequently shows... 
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the data requirements are," Gold-Bernstein said. "Sometimes you 
cannot predict how people will randomly ask questions" of a database, for 
example . 

Sarma advocates a statistical approach. Compiling figures on 
database calls and physical I/Os gives managers a basis for assessing 
network load, he said. 

Application partitioning can also be a powerful tool for managing 
network load and making the best use of CPUs on both client and server 
machines. But Sarma. . . 
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Center, San Jose, CA 95120. 
Further Results on the Security of Partitioned Dynamic 
Statistical Databases Mary McLeish 

Partitioning is a highly secure approach to protecting statistical 
databases . When updates are introduced, security dependents on putting 
restrictions on the sizes of partition sets which may be queried. To 
overcome this problem, attempts have been made to add "dummy" records. 
Recent work has shown that this leads to. . . 
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ABSTRACT: Partitioning as a means of protecting statistical data bases 
is a highly secure approach. Maintaining security during updates requires 
restricting the sizes of partition sets which may be queried. Adding 
dummy records to overcome this problem has been shown to cause high 
information loss. A model is presented which... 

...and alternatives to adding dummy records presented. The security problem 
is examined, with if and only if conditions considered. Security is found 
to hold if partition sizes are kept even. The practical implications of 
this model for the database manager are considered. 
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TEXT: 

. . .pointers to custom written logic routines; descriptions of object 
behaviors and relationships between objects; input device bindings to 
actions; collision callback mapping; playback constraints; LOD definition 
; BSP partitions ; structural database definition ; and networking 
protocols. Activation's features can be accessed by designers through 
point-and-click mouse commands, or by programmers through taggable code 
assigned to . . . 

...for all leading game platforms, including Sony Playstation, Sega Saturn, 
Nintendo 64 and PCs running Microsoft DOS or Microsoft Windows. Included 
with Activation are three sample games which illustrate the breadth of 
the program's prototyping capabilities. The sample games are Race to Los 
Gatos, a 3D racing game; Mythology Fight, a 3D fighting title; and Space 



'Cadet, an action-oriented space battle game. 
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time graphic displays about transaction activity, CPU and device 
utilization, and network traffic. 

The SYBASE Configurator {TM ) product provides capacity planning and 
design modeling for database environments. The software analyzes user 
statistics , capacity requirements, application design information, and 
throughput requirements, and then recommends hardware configurations, 
selects database partitioning and estimates performance. Such 
capabilities are especially important for sites with massive amounts of 
data and high transaction and query volumes. The initial release of 
Configurator is... 
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...pointers to custom written logic routines; descriptions of object 
behaviors and relationships between objects; input device bindings to 
actions; collision callback mapping; playback constraints; LOD definition 
; BSP partitions ; structural database definition ; and networking 
protocols. Activation's features can be accessed by designers through 
point-and-click mouse commands, or by programmers through taggable code 
assigned to . . . 

...for all leading game platforms, including Sony Playstation, Sega Saturn, 
Nintendo 64 and PCs running Microsoft DOS or Microsoft Windows. Included 
with Activation are three sample games which illustrate the breadth of 
the program's prototyping capabilities. The sample games are Race to Los 
Gatos, a 3D racing game; Mythology Fight, a 3D fighting title; and Space 
Cadet, an action-oriented space battle game. . . 
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Accrual and cash flow accounting models : a comparison of the value 

relevance and timeliness of their components . 
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Nov, 1996 
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... 1,2,5 and 10 years are employed. 

Cash flow from operations and operating accruals data were generously 
supplied by Percy and Stokes (1992). Their sample comprised 107 firms for 
which the information required to calculate the cash flow from operations 
and operating accruals measures was obtained from the AGSM Annual Report 
File. This sample represents all of the firms existing in Australia from 
1975 to 1985 for which data are available on the AGSM database . The time 
period is comparable with the second half of EHO's sampling period which 
runs from 1976 to 1986. 

The ten year event window used in this study implies that a firm must 
have ten consecutive years... 
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... the data requirements are," Gold-Bernstein said. "Sometimes you 

cannot predict how people will randomly ask questions" of a database, for 
example . 

Sarma advocates a statistical approach. Compiling figures on 
database calls and physical I/Os gives managers a basis for assessing 
network load, he said. 

Application partitioning can also be a powerful tool for managing 
network load and making the best use of CPUs on both client and server 
machines. But Sarma... 
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. . . disincentives associated with retrospective payment systems (wage 

loss) . Those disincentives have been mentioned in the discussion of the 
preceeding paper. 

Under the suggest system, the data base of disability claims 

(excluding permanent total because of the small sample size ) would be 
partitioned according to major categories. The latter might be linked to 
the functioning of body organs. A distribution of lost work days would be 
estimated for . . . 
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... 5) More specifically, households whose heads were self-employed, 

retired, not working but not retired, or in the occupational category 
"other" were eliminated from the data base . Next, demographic classes 
were defined in order that the subsamples of households residing in a given 
state could be partitioned . Twelve classes were defined on the basis of 
marital status, education, and race. Any household that did not respond to 
these questions was removed from the sample . These deletions left a total 
of 9,242 households in 41 States to be partitioned. 

Once each state subsample had been partitioned, averages of 
disposable . . . 
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ABSTRACT: The structure of coauthor graphs and the statistical validity 
of the associated author partitions are investigated as a function of 
productivity and collaborative thresholds. The productivity threshold 
determines the number of authors (points) in a coauthor graph, and the 
collaborative threshold determines the number of coauthor pairs (lines) in 
the graph. The statistical validity of author partitions is determined 

by the random-graph hypothesis. The results show that for 'small' 
databases , statistically preferred partitions occur when all authors and 
coauthor pairs appear in the graph. For 'large' databases , statistically 
preferred partitions occur when authors and coauthor pairs who publish only 
one article are excluded from the graph. Unlike other bibliometric 
relationships, the highly. . . 

...the collaborative relationship produces a wide range of threshold values 
for which the associated partitions are statistically valid. It remains to 
be shown how the statistical validity of partitions is related to the 
empirical significance of the same partitions. (Reprinted by permission of 
the publisher . ) 



12/3, K/16 (Item 1 from file: 15) 

DIALOG (R) File 15 : ABI /Inform (R) 

(c) 2003 ProQuest Inf o&Learning , All rts. reserv. 
02139595 70047784 

Employment structure and training needs in the Louisiana value-added wood 
products industry 

Vlosky, Richard P; Chance, N Paul 

Forest Products Journal v51n3 PP: 34-41 Mar 2001 
ISSN: 0015-7473 JRNL CODE: FPJ 
WORD COUNT: 38 93 

. . .TEXT: in this study were conducted in accordance with well-documented 
and verified techniques (3,6,7, 10). The following sections describe these 
procedures . 



SAMPLING 



The sample frame for the study consisted of all secondary solid wood 
products manufacturers in Louisiana. Examples of industry sectors 
represented include hardwood dimension and flooring mills, wood kitchen and 
bath cabinets, wood household furniture, wood office furniture, store 
fixtures, pallets, partitions , etc. There are estimated to be 
approximately 650 companies in this population in Louisiana (12) . The 
primary source of sample frame information was existing industry 
directory databases and directories compiled by the LFPL (4). 

MAIL QUESTIONNAIRES 

Data collection was done using a mail survey questionnaire. Mail 
questionnaires were chosen as the most . . . 
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...TEXT: do not have incentives to forecast. Evidence regarding this 
conjecture is presented in the association tests that follow. 

TABLE 3 

Looking at subsamples of the data based on prior-year earnings 

performance reveals that the price-based forecast outperforms the 
financial-statement- analysisbased forecast in both partitions of the 
data. Even in the poor performance partition of the sample where 
transitory earnings are more prevalent, the pricebased forecast still 
appears to generate more accurate predictions. In addition, all three 
forecast sources produce smaller errors... 
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...TEXT: planners were analysed for both small and medium-sized firms. 
Other than for accounting packages, the results revealed that high planners 
in the small company sample used the various software packages to a 
significantly greater extent than low planners, particularly in respect of 
spreadsheets, databases , MIS and statistical packages. The mean usage 
scores for high planners in the small firm sub- sample for spreadsheets 
(4.24), databases (4.06), MIS (3.39) and statistical packages (2.94), 
were significantly higher than for low planners in the small company 
subsample -who had mean scores of 3.84, 3.52, 2... 
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...TEXT: Barniv and Hathorn (1997) on mergers and insolvency. 

Most life/health solvency studies have appeared after 1990 and also show a 
migration from matched-pairs samples to whole-industry analyses with the 
advent of the NAIC databases . Barniv and Hershbarger (1990) used 
matched-pair sampling of pooled data from 1975 to 1985 to correctly 
classify the insolvency status of between eighty-two and ninety-one percent 
of life insurers one and two years in advance. More recently, Ambrose and 
Carroll (1994) used matched-pair sampling of pooled data from 1969 to 
1986 to predict life insolvencies for 1987 to 1991. They attribute their 
finding of relatively low predictive power to temporal changes in the 
factors responsible for insolvency over long time spans. 3 Using the NAIC 
database for 1986 through 1991, Carson and Hoyt (1995) compared logistic 
regression, recursive partitioning , and discriminant analysis for 
predicting life insolvencies. Although they did not analyze segments, they 
conjectured that "models segregated by insurer size and product line also 
may yield additional... 
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...TEXT: pipeline parallelism. Collectively, the DB2 Family provides 
additional support for data warehouse performance and scalability: 

Parallel-aware, cost-based search optimizers that exploit a wide range of 
database statistics 

Intelligent partitioning 

Parallel database operations, including (but not limited to) parallel table 
and index scans, joins, backup/recovery, and utilities 

Specialized indexes and index processing 

SQL extensions. . . 
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ABSTRACT: In the relational database system, the join operation is one of 
the most important due to its frequent uses, especially if relations are 



normalized. Of several algorithms proposed for. . . 

, . , can be saved, compared with the sort-based algorithm. The main 
improvement of the hybrid algorithm comes from completely sorting only the 
smaller relation and partitioning the others into ranged buckets 

according to the order statistics of the sorted relation. In analyzing 
the performance of the hybrid join and comparing it to other methods, it is 
shown that the hybrid join. . . 
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ABSTRACT: The structure of coauthor graphs and the statistical validity 
of the associated author partitions are examined as a function of 
productivity and collaborative thresholds. The statistical validity of 
author partitions is determined by reference to the random-graph 
hypothesis. The results indicate that, for "small" databases , 
statistically preferred partitions occur when all authors and coauthor 
pairs appear in the graph. For "large" databases , statistically 
preferred partitions emerge when authors and coauthor pairs who publish 
only one article are excluded from the graph. Unlike other bibliometric 
relationships, the highly. . . 

. . . the collaborative relationship generates a wide range of threshold 
values for which the associated partitions are statistically valid. It 
remains to be demonstrated how the statistical validity of partitions is 
related to the empirical significance of the same partitions. ... 
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ABSTRACT: The sorting of large files of data derived from bibliographic or 
other textual data bases can be an expensive procedure. Therefore, any 
slight increase in the efficiency of sorts can contribute to reduction in 
costs to the users of information services. One method of sorting large 
textual data bases by computer uses external storage and divides the 

range of sort-keys in a sample of data to be sorted into a fixed set of 
partitions. The partitions are composed of ordered key ranges , and an 
incoming data stream is distributed into a series of bins according to the 
partition in which the key lies. The bins are then... 

... sort, to give an ordered file. The number of disc accesses needed 
depends on the manner in which the bins become filled and, thus, on 
statistics of the data. An experiment using an INSPEC data base 
suggests that this method of sorting is feasible and that it is possible to 
generate a partition set from a reasonably small sample of the data to be 
sorted. . . . 
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ABSTRACT: A national cartographic data base is needed which can readily 
produce computer mapping among organizations having much the same needs. 
Many data bases designed to support federal programs could be applied 
by other users. Such data needs to be made more accessible to possible 
users at all levels... 

. . . which use topological data to describe polygonal features, such as city 
blocks, could also be used to map such things as land use or health 

statistics . A topological data base would have wide applicability. 
The capability to merge 2 cartographic data files into one data base 
has also been developed. Further work should also be done on merging a 
gridded data base with the topological data structure. A gridded data 
base is used in cases where data cannot be partitioned into exactly 

defined boundaries , such as in the case of rainfall. 



'V 15/3,K/1 (Item 1 from file: 275) 

/•SDIALOG(R) File 27 5: Gale Group Computer DB(TM) 
v ^ifc) 2003 The Gale Group. All rts. reserv. 

02483843 SUPPLIER NUMBER: 70909008 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Look Before You Leap. (Technology Information) 
SARADHI, VI JAY; SIMONEAU, MARTIN 
Intelligent Enterprise, 4, 3, 40 
Feb, 2001 

LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 2784 LINE COUNT: 00223 

...ABSTRACT: re-engineering. Managers must perform extensive evaluations 
of software, hardware and the overall data model before designing the ETL 
process. The next steps are to size the database and determine 
partitioning strategy. Building the actual prototype involves first 
identifying its focus and then creating database objects using the models 
and tools selected. A prototype should be populated with a good-size 
sample of real data if possible. Generating reports and running ad-hoc 
queries helps test a warehouse. The final step is to digest the results 
from. . . 
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... task in a similar fashion. When you launch either program, it 

allocates as much memory as possible for a packet buffer. The buffer size 
is determined by the size of the MultiFinder memory partition and can 
be as large as you want. When you start collecting packets, both programs 
display statistics and charts of the number of packets captured, errors 
found and the amount of network bandwidth being used. 
Packets are displayed in a main window... 
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...ABSTRACT: The primary design objective of this controller is to 
maximize the throughput of the signal processing module as well as 
controller task timing and queueing statistics . The model is 
parameterized to allow sensitivity analysis of functional components, 
task partitioning , queue sizing and data input rates. (Reprinted by 
Permission of Publisher.) 
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gave different results and hence "a specified procedure with 
pre-determined calibration curves has to be followed in order to obtain 
reliable and reproducible results". 

Samples of the PVC films were contacted with different 
concentrations of PVC in the water or oil (in the ranges 50-200, 30-105, 
and 20-50 ppb) and the system stirred until equilibrium was reached, when 
the liquid phase was analysed. VCM in the polymer was estimated by 
difference . 

Partition coefficients (polymer to liquid ranged from ca 1 to 8 
for the corn oil, and 6 to 40 for the water, increasing generally (not 
invariably) with reducing VCM concentration. 
The. . . 



15/3, K/5 (Item 1 from file: 16) 

DIALOG { R) File 16: Gale Group PROMT (R) 

(c) 2003 The Gale Group. All rts. reserv. 

09051356 Supplier Number: 78932836 ( USE FORMAT 7 FOR FULLTEXT) 
Advances in Emulsion Polymerization For Coatings Applications: Latex Blends 
And Reactive Surfactants. 

El-Aasser, Mohamed S.; Tang, Jiansheng; Wang, Xiaoru; Daniels, Eric S.; 

Dimonie, Victoria L . ; Sudol, E. David 

The Journal of Coatings Technology, v73, n920, p51 

Sept, 2001 

Language: English Record Type: Fulltext 
Document Type: Maga zine/ Journal ; Trade 
Word Count: 9057 

homopolymer, or copolymer) after the polymerization. Incorporation 
is favored at lower surfactant and higher initiator concentrations. These 
results are not unexpected. 

SURFACE vs. BURIED: The sample prepared with 30 mM reactive 
surfactant and 8 mM ( Na . sub . 2 ) ( S . sub . 2 ) (0 . sub . 8 ) was subjected to further 
analysis to determine the extent of partitioning of the surfactant 
between the surface and interior of the latex particles. By partially 
swelling the particles with THF, ion exchange and titration of the. . . 
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. . . of more statistically sound measures (see Barber and Lyon (1997) 

and Kothari and Warner (1997)). These better measures include the 
size-adjusted returns we provide. 

Size -adjusted returns are calculated by partitioning the 



highly shorted sample and all Nasdaq stocks into market value deciles at 
the time of each of the 55 announcements. Net-of-size portfolio returns are 
calculated as . . . 



15/3, K/7 (Item 3 from file: 16) 

DIALOG ( R) File 16: Gale Group PROMT (R) 

(c) 2003 The Gale Group. All rts. reserv. 

06298040 Supplier Number: 54488438 (USE FORMAT 7 FOR FULLTEXT) 
Novel screening unit provides alternative to conventional shale shaker. 
Dehn, Courtney 

The Oil and Gas Journal, v97, nl5, p40(l) 
April 12, 1999 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal ; Trade 
Word Count: 4 601 

... in the feed . 

As it is virtually impossible to measure the flow rates of the feed 
(undersize and oversize streams in real time operations), the partition 
numbers for the various size fractions must be determined from sample 
data gathered in a steady state for the three streams using an analytical 
equation . 

The undersize and oversize partition numbers are derived in Equations 

2. . . 
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p. sub. j) ( (X.sub. ji) ) , (17) 

where (X.sub.ji) are independent draws from (p. sub. j) and the 
subscripts on I denote the partition of unity and the sample sizes 
used. The estimate (I.sub.n, w) is unbiased under mild conditions on the 
supports of the function (p.sub.j) and (w.sub.j). 

Veach and Guibas... 
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... and 20.7 percent in BH terms in the fourth year. (We do not use 

four-year returns elsewhere in the paper.) 

Panel C considers sample partitions by size, book-to-market ratios, 
and time-period. The size partition is based on market capitalization 
measured at the time of the first financial statement post-IPO with 
cutoffs at $20 million and $100 million (in December 1997 real dollars) . 
The differential. . . 
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... sub.it) in response to a change of one standard deviation in the 

value of this explanatory variable. A comparison of this change to the 
size of the partitions provides a measure of the economic importance 
of a variable. As a further aid in interpreting the probit model, Table III 
contains descriptive statistics of the distributions of the explanatory 
variables by rating category and overall. 

The variance of the standard errors of the probit model, which can be 
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... is provided in Section 3, Example 1. 

2.4 The Power of the Test 

The power of the test depends on the accuracy of the partition 
boundaries as estimates of "true" boundaries , the difference in 
intensities (Lambda) and (Lambda) (prime ) , and the size of the region of 
nonhomogeneity (R.sup.A). In the example considered herein, the... 

...than the healthy tissue, then this partitioning scheme has the potential 
to yield greater power than a standard quadrat test or a conventional 
spatial scan statistic . The power of the test based on (2) can be 
calculated exactly. In particular, we consider the best-case scenario in 
which (R.sup.A. . . 
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the market value effects of advertising and R&D from other 
industry-specific valuation considerations. In addition, the basic 
valuation model can be analyzed over samples of advertising-intensive and 



R&D-intensive industries to learn the extent to which the valuation effects 
of advertising and R&D expenditures are mitigated by substantial 
promotional and innovative activity by competitors. By analyzing the 
overall sample of firms, in addition to a simple two-part breakdown for 
manufacturing versus nonmanuf acturing firms, it becomes possible to learn 
the extent to which expenditures... 

...and R&D have broad rather than narrow implications for the value of the 
firm. By considering the market value implications of a three-part sample 

partition according to firm size ( measured by sales revenue), the 
extent to which firm size plays a role in determining the market value 
effects of advertising and R&D can also... 
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... that this condition guarantees uniqueness. 

Andreatta and Kaufman (1986) adapt Murthy's (1957) estimator, a close 
relative of Horvitz and Thompson's estimator, to successive sampling in a 
different way. If any one population characteristic such as the number N of 
deposits, the sum of all deposit magnitudes or a fractile. . . 

. . . in-place deposits is assumed to be known with certainty, then this 
knowledge be used to compute an estimate of inclusion probabilities from an 
incomplete sample of the population. They call this "anchored 
estimation," the known population characteristic being the "anchor." An 
application to North Sea data partitioned into seven size classes 
recovers MLE estimates for each of these size classes so closely as to 
suggest a tight link between conditional (on the anchor) MLE and unbiased 
estimation via anchoring... 
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...TEXT: both subperiods, mimicked the earlier results. Thus, the changed 
relationship between return and beta was not driven by a sector effect. 

Does size matter? To assess the potential confounding effect of size , 
we partitioned stocks into three size groupings based on market 
capitalization in each June. The "large" stock group comprised the largest 
100 stocks; the "medium" group, the next largest 200 stocks; and the 
"small" group, the remaining stocks. The mean monthly return for the entire 
sample period varied significantly with size: Large stocks averaged 1.24 
percent; medium stocks averaged 1.07 percent; and small stocks averaged 
2 . 35 percent a . . . 
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...TEXT: from studies that examined multiple products. When subjects or 
dependent variables differed, we calculated separate effect size estimates 
across products. The independent variable used to partition studies for 
calculating the effect size estimates was ad format (comparative ad 
versus noncomparative ad) . Comparative ads were those that explicitly or 
implicitly compared the sponsor's brand with another brand in... 

... three ad levels (e.g., direct comparative, Brand X, noncomparative), we 
used direct comparative versus noncomparative ads to calculate the effect 
size estimate. The d- statistic was coded positive if the comparative ad 
produced more favorable results than the noncomparative ad and negative 
otherwise . 

Moderating variables were included in our analysis. . . 
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...TEXT: the New Zealand audit market, but the interactive nature of the 
relationships among audit firm size, company size, and company listing 
status require a further analysis of the sample partitioned on 
company size (large vs. small) and listing status (listed vs. unlisted). 
The partitioned regression results demonstrate that the Big 5 received fee 
premiums from large listed and. . . 
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...TEXT: size than the incumbent auditor, relative to clients that change 
auditors with no disagreement disclosed. Table 8 presents data regarding 
the incumbent and new auditor partitioned by two different measures of 
size : (1) Big 8 versus Non-Big 8; and (2) total audited sales. (13) (Table 
8 omitted) For the Big 8 versus Non-Big 8 classification. . . 

... to a Big 8 auditor (22 percent versus 11 percent) , a chi sup 2 test on 
the two-by-two contingency table including the 28 sample firms changing 
to different size auditors indicates an insignificant difference between 
the switch behavior of the D group and the ND group. When the Big... 
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. . .ABSTRACT: which use topological data to describe polygonal features, 
such as city blocks, could also be used to map such things as land use or 
health statistics . A topological data base would have wide applicability. 
The capability to merge 2 cartographic data files into one data base has 
also been developed. Further. . . 

... be done on merging a gridded data base with the topological data 
structure. A gridded data base is used in cases where data cannot be 
partitioned into exactly defined boundaries , such as in the case of 
rainfall . 



/ 17/3, K/l (Item 1 from file: 621) 

DIALOG (R) File 621: Gale Group New Prod.Annou. (R) 
(c) 2003 The Gale Group. All rts. reserv. 

02244292 Supplier Number: 57830035 (USE FORMAT 7 FOR FULLTEXT) 
CybeRecord Successfully Tests Automatic Document Image Recognition 
Software . 

Business Wire, p0199 
Nov 30, 1999 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 4 60 

... Inc. {Nasdaq : CYRD) announced today that it has successfully tested 

automatic image recognition features of its digital document processing 
software on a large pre-scanned sampling of diverse microfilm formats. 

The software's innovative statistical modeling algorithms 
automatically locate individual image boundaries on the scanned microfilm 
and partition the digital file into standardized pages, eliminating 
excess data. The company is developing an automatic image recognition, 
enhancement, and restoration solution that is essential for... 
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firms from Canada; and 
8) (m.sub.j) is the disturbance term for firm j in year t. 
Results 

Summary Statistics 

Table Al presents summary statistics of the measuring variables. 
Panel A provides summaries of variables for the total sample , while 
Panels B, C, and D partition the statistics by region and year of 
study. From Panel A, the statistics reveal that the mean market reaction 
(INFOann) to foreign earnings announcement is 1.98 percent of the 
security's price and 1.04 percent during... 
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encoded using only simple sufficient stat istics--thus allowing 
models to be updated without the need to rerun past data. Note that the 
number of sufficient statistics for each partition is a linear 
function of its cardinality, which is typically a small fraction of the 
sample size. Thus the notion of interruptibility can be applied when n is 
large . 



APPENDIX A: APPROXIMATE WEIGHTED CHINESE RESTAURANT ALGORITHM FOR 
SINGLE-MEASUREMENT DATA 
Plug. . . 
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branches in the sample is comparable to the OECD figures for 
Belgium, Germany, and France, while, for the other countries, larger banks 
are slightly over sampled . 

Table 2 provides some statistics on the partition of the 
observations in the sample according to branching behaviour. It appears 
that, within the set of multi-branch banks, the majority (2565) opened new 
branches or kept their network size... 
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... divide the sample into scanners and non-scanners. Firms scoring 

above the mean were classified as scanners, those below the mean as 
non-scanners . 

The statistical methods used in testing the hypotheses were 
analysis of variance (ANOVA) and t-tests. ANOVA was used to partition the 

sample into scanners and non-scanners on each scanning index across the 
growth and maturity stages of the industry life cycle. (Growth and maturity 
were the . . . 
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n - 3 Years 
Intercept (( (alpha ). sub . 0) ) 
(t-statistic ) 



-1.66 (***) 
(-22.55) 



(FE.sub.t-n) ({ (alpha) .sub. 1) ) 0.12 (**+) 

(10.09) 

F Statistic 102 

Adjusted (R.sub.2) 0.011 
Sample Size 9,411 
(***.) Significant at the 0.01 level. 

Descriptive Statistics for Partitions of Lagged Residual Forecast 
Error (partitioned by thirds) 

(FE ' . sub . j t-n) the forecast error for firm j (i.e., ( (E. sub. 1) - (F. sub 
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... non- Januaries during 1975-84, and non- Januaries during 1985-92. 

Results indicate market beta is not priced for Canadian stocks in any 
of the sample partitions . None of the t- statistics is significant for 
the beta risk premia. In fact, the risk premia for beta are negative, 
(although insignificant) for most of the partitions presented in... 

...S. markets over various subperiods . However, they do not provide 
contrasts between January and non- January. 

Firm size effects for Canadian stocks exist in each sample 
partition presented in Table 3. All of the t- statistics for the 
size-risk premia are significant. Findings indicate the firm size effect is 
noticeably stronger in January than in non-January. For example, over 
cross-sectional regressions and the respective t- statistics are 

reported for each sample partition . The first column presents the 
beta risk premia and respective t- statistics and the second column 
presents the size risk premia and respective t-statist ics . 

** Significant at the 5 percent level. 
Our tests show firm size effects... 
1975-84, 1985-89, January only, and non-January 

months. The averages of the gammas estimated from the 
cross-sectional regressions and the respective t- statistics are 
reported for each sample partition . The book-to-market risk premia 
and corresponding t- statistics are presented in the first column The 
size risk premia and corresponding t-statistics are presented in the 
second column. 

Significant at the 5 percent... 
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... the radius (Sigma) of the median filter for larger target signals. 

This independence assumption is key to the derivation in Section 2.2 of the 



sampling distribution of the spatial scan statistic based on the 
stochastic partition . The effect of this presmoothing on the partition 
(W. sub. (Sigma) )( (Zeta) ) is investigated in Section 3.1. 

In summary, given a realization (Zeta), W((Zeta)) = {(R.sub.l), 
(R.sub.K) } produces. . . 
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188-205. 

(1986b), "Combining Minimax Shrinkage Estimators," Journal of 

the American Statistical Association, 81, 437-445. 

Geyer, C. J. (1992), "Practical Markov Chain Monte Carlo," 
Statistical Science, 7, 473-511. 

Hartigan, J. A. (1990), " Partition Models," Communications in 
Statistics , Part A - Theory and Methods, 19, 2745-2756. 

Hastings, W. K. (1970), "Monte Carlo Sampling Methods Using Markov 
Chains and Their Applications, " Biometrika, 87, 97-109. 

James, W., and Stein, C. (1961), "Estimation With Quadratic Loss," in 
Proceedings of the... 
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... it difficult for them to retain their position. Also bidders in 

rejected offers are performing insignificantly better than the bidders 
whose offers are accepted (t- statistic - 1.09). 

Panel D of Table 2 partitions the sample according to the form of 
payment. It shows target firm's performance prior to the bid is more 
negative when equity is the form of . . . 
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... 60 43 4 4 

1986(*) 32 26 2 1 



Total 800 579 375 191 

* The number of 1986 issues is small because the Registered 
Offering Statistics tape available for this project does not 
include all 1986 registrations. 

TABLE 3 Sample Size of Each Portfolio Partition 

A. Common Stock Offers 

D Precedes B D Follows B Row Total 

X precedes B Portfolio 1 Portfolio 2 

75 38 113 

(39.27%) (19... 
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... Roley's analysis covers six years (1977-1982) with 27 changes, and 

Hafer studies the 1977-1984 period with 32 changes. In addition, each study 

partitions the sample into different monetary policy regimes. The 
resulting small sample sizes diminish the chances of finding statistical 

significance even if economic significance exists. 

5 As indicated by Waud (1970), frequently the New York Fed Bank will 
either lead or lag changes made... 
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... size quintile into quintiles according to UEARN . Second, in 

quintiles according to UEARN then within each earnings quintile according 
to SIZE . For each of these partitions we use a statistic based upon 
Jonckheere [1954] which tests for a k- sample trend against ordered 
alternatives .( 9) For a given size partition , we expect abnormal returns 
to increase as unexpected earnings increases leading to a significant 
positive TJ statistic across earnings quintiles. These results are 
presented in. . . 
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...TEXT: for this purpose, and this forces us to use cross-sectional data 
to quantify the agerelated differences in inequality. 

Specifically, we do the following: we partition the SCF sample into 10 
cohorts according to the age of the household heads, we compute the 
relevant statistics for each cohort, and we compare them with the 
corresponding statistics for the entire sample. These statistics are the 
cohort average earnings, income, and wealth... 
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...TEXT: and growth are systematically invoked by analysts when summarizing 
the investment potential of stocks. 

In Panel B of Table 3, the second and third columns partition the sample 
based on whether a target price is disclosed, and the final two columns 
present a chi 

sup 2 

statistic and p-value for whether the distribution of each justification 
differs between the two subsamples . Generally, there are two significant 
differences between the reports that... 
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...TEXT: non-Big 6 auditors. The data also indicate that the specialists 
audit the larger insurers (74.6 percent of net premiums written). 

Table 2 reports sample descriptive statistics by audit firm type. 
Univariate t-tests, Chisquare tests, or Wilcoxon rank-sum tests are 
performed across the partitions as a preliminary analysis. Table 2 
indicates that the variable of interest OUTBD, measured as the percentage 
of outside directors on the board of directors... 
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ABSTRACT : Is the Ewens distribution the only one-parameter family of 
partition structures where the total number of types sampled is a 
sufficient statistic ? In general, the answer is no. It is shown that all 
counterexamples can be generated via an urn scheme. The urn scheme need 
only satisfy. , . 
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...TEXT: and nonstressed firms (replicating Hopwood et al.) and between 
firms undergoing recession and those not undergoing recession. 



This result is further refined by dividing the sample into the four 
partitions described earlier and testing for incremental improvements 
from stress and recession knowledge. The lambda statistic for the 
difference between the unconditioned and the recession- and stressed- 
controlled model -2LogL measures is 126.783 (24 df ) , and is also 
statistically significant... 



17/3, K/19 (Item 6 from file: 15) 

DIALOG ( R) File 15 : ABI/Inf orm (R) 

(c) 2003 ProQuest Inf o&Learning . All rts. reserv. 



01318293 99-67689 

The effects of cross-sectional scale differences on regression results in 
empirical accounting research 

Barth, Mary E; Kallapur, Sanjay 

Contemporary Accounting Research vl3n2 PP: 527-567 Fall 1996 
ISSN: 0823-9150 JRNL CODE: CAR 
WORD COUNT: 15195 

...TEXT: the sample median and zero otherwise. Untabulated findings reveal 
a significantly positive coefficient on SALESDEPR-c ' s estimate is 173.34 
with a White t- statistic of 5.65. 

To use Barth (1994) for illustrating the diagnostic, we partition Barth ' s 
1989 sample based on a scale proxy, book value of equity for the 
investment securities regressions, and net income for the securities gains 
and losses regressions. Regression... 
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...TEXT: and college training) were not included in any of the final models 
because of insufficient data, or were eliminated because the analyses 
showed no apparent statistical relationship at any reasonable level for 
any factor or respondent subset. Area of employment is used to partition 
the sample into subgroups for further analysis. Variable X sub 5*6 , 
which we label "employer support", is the interaction between encouragement 
and training. Model parameters are... 
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WORD COUNT: 8095 

...TEXT: group omega sub i if x is in the region Omega sub i . For the 
financial distress case, the estimation problem is to identify a 
statistical model that best partitions the sample space of companies' 
financial ratios into the two groups: FIC and nonFIC. 

We also confine ourselves to the Bayes minimal risk decision rule, which is 
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...TEXT: tests do reject the hypothesis that the coefficients are the same 
in the two sets of countries and the level of real GDP chosen to partition 
the sample was that which maximized the F statistic in this test. (10) 
These results suggest that the significantly negative impact of a larger 
government share is confined to the high income countries. For... 
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...ABSTRACT: based on the use of binary search trees for tree partitioning. 
With the new method, incoming files are decomposed into partitions with 
binary trees until partitions reach a manageable size for internal 
sorting. The search tree is generated by deriving statistics from a 
small sample of the data to be sorted. The method should be applicable to 
any data characterized by some degree of regularity, such as bibliographic 
and natural ... 
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Fulltext Availability: 
Detailed Description 
Claims 

Fulltext Word Count: 14 938 
English Abstract 

A training database (including data mining algorithm descriptions and 
meafeatures characterizing probability density functions of features) in 
the memory and computer readable program code (1) to extract features 
that classify data, (ii) to calculate metafeatures describing the case 
probability density function, and (iii) to select a data mining algorithm 
by using the training database to map the calculated metafeatures 
describing the case probability density funciton to the selected data 
mining algorithm. The frequeny of the occurrence of features with respect 
to datum in teh data defuing a case probability density function. 

French Abstract 

L * invention concerne une base de donnees d ' entrainement (comprenant des 
descriptions et des meta-element s d' algorithme d ' exploration en 
profondeur de donnees caracterisant des densites de probabilite 
d' elements) logee en memoire, et un code de programme lisible par 
ordinateur destines a: (i) extraire des elements de classement des 
donnees; (ii) calculer des meta-elements decrivant la densite de 
probabilite du cas; iii) choisir un algorithme d ' exploration en 
profondeur de donnees en utilisant la base de donnees d ' entrainement pour 
mapper les meta-elements calcules decrivant la densite de probabilite du 
cas relativement a 1' algorithme d 1 exploration en profondeur de donnees 
choisi. La frequence d' occurrence des elements par rapport a la reference 
dans les donnees definit une densite de probabilite du cas. 

Legal Status (Type, Date, Text) 

Publication 20020919 Al With international search report. 

Examination 20030109 Request for preliminary examination prior to end of 

19th month from priority date 



Fulltext Availability: 



Detailed Description 



Detailed Description 

this embodiment may also identify a point of diminishing returns in 
the number of features and estimate feature robustness. The computer 
readable program code to estimate feature robustness may also 
partition the data into subsets, temporally, sequentially, randomly, or 
otherwise. The computer readable program code to estimate feature 
robustness in this embodiment may then calculate the entropy of each 
subsetas a statistical measure of similarity. The computer readable 
program code in this embodiment may also identify parameters (such as 
user preferences, real-time deployment issues, available memory. . . 
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Claims 
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English Abstract 

A method of managing storage in a document image database (14) using 
document analysis (32) to partition documents into logical regions 
and modified by reducing storage size of the regions using different 
reduction modifiers according to various storage preference rules (78) . 
Storage preference rules are intended to maintain high quality 
representations of important document information while reducing storage 
requirements at the expense of lesser important aspects of the document. 
In particular, the different reduction modifiers (34) applied to stored 
document images include reducing sampling depth, reducing sampling 
resolution based on minimum font size, utilizing lossy and lossless 
compression schemes and discarding unimportant regions of document image. 
Over time, document analysis and modification can be repeated to further 
reduce the storage size of previously stored data files (50, 52, 54). 

French Abstract 

L* invention concerne un procede permettant de gerer la memoire d'une 
base de donnees d'imagerie documentaire (14) en utilisant 1' analyse de 
documents (32) . Celle-ci permet de diviser les documents en regions 
logiques et de les modifier en reduisant leur volume de memoire. Cette 
derniere operation se fait a l'aide de modif icateurs de reduction et 
selon differentes regies pref erentielles de stockage (78) . Ces dernieres 
ont pour but de conserver la haute qualite des representations de 
documents importants tout en reduisant les exigences de stockage aux 
depens de certains aspects des documents qui presentent une moindre 
importance. En particulier, les differents modificateurs de reduction 
(34) appliques aux images stockees comprennent la reduction de la densite 
d' echantillonnage, la reduction de la resolution d ' echantillonnage basee 
sur la police minimum, 1 ' utilisation de procedes de compression a pertes 
et sans pertes et 1' exclusion des regions des documents qui ne presentent 



pas d 1 importance . Par la suite, 1' analyse de documents et la modification 
peuvent etre repetees afin de reduire a nouveau le volume de memoire des 
fichiers precedemment stockes (50, 52, 54). 

English Abstract 

A method of managing storage in a document image database (14) using 
document analysis (32) to partition documents into logical regions 
and modified by reducing storage size of the regions using different 
reduction modifiers according to various storage preference rules (78) . 
Storage . . . 

...requirements at the expense of lesser important aspects of the document. 
In particular, the different reduction modifiers (34) applied to stored 
document images include reducing sampling depth, reducing sampling 
resolution based on minimum font size, utilizing lossy and lossless 
compression schemes and discarding unimportant regions of document image. 
Over time, document analysis and modification. . . 
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ABSTRACT EP 992909 A2 

A method and apparatus for distributing computer resources in a network 
environment. A network of computer systems is partitioned into at least 
one computing system partition , and is configured into at least one 
redundancy group. The computing system partitions include 
applications, computing system nodes, and copies of a database 
schema. The copies of the database schema are replicated at each 
computing system partition within a network. The computing system 
partition manages interactions between the instances, the computing 
system nodes, and the copy of the database schema within the 
respective computing system partition . The redundancy group comprises 
at least one computing system and at a plurality of computing system 
partitions , and manages the replication of the database schema 
within the computing system and computing system partitions . 

ABSTRACT WORD COUNT: 126 

NOTE: 

Figure number on first page: NONE 

LEGAL STATUS (Type, Pub Date, Kind, Text) : 
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at least one computing system partition , including at least one 
instance of an application, at least one computing system node, and 
at least one copy of a database schema, the copies of the 
database schema being replicated at each computing system 
partition within a network, and wherein each computing system 
partition manages interactions between the instances, the computing 
system nodes, and the copy of the database schema within the 
respective computing system partition ; 

a plurality of computing systems connected together via the network, 
wherein each computing system comprises one or more computing system 
partitions; 

at least one redundancy group, comprising at least one computing system 
and a plurality of computing system partitions , wherein each 
redundancy group manages the replication of the database schema 
within the computing system and computing system partitions 
within the redundancy group. 
2. The system of claim 1, wherein the redundancy group defines a first 
computing system as the computing system that replicates... 

...10. A method for distributing computer resources in a network 
environment, comprising the steps of: 
assembling, as part of a computer network, at least one computing 
system partition , including at least one instance of an 
application, at least one computing system node, and at least one 
copy of a database schema, the copies of the database schema 
being replicated at each computing system partition within the 
computer network; 
configuring, within the computer network, a plurality of computing 
systems connected together via the computer network, wherein each 
computing system comprises... 

. . .method of claim 10, wherein the task is a database replication within 
the computer network. 

12. The method of claim 11, wherein the task of database replication 
is performed by a first computing system partition within the 
redundancy group, 

13. The method of claim 12, wherein the task of database replication 
is performed by a second computing system partition within the 
redundancy group when the first computing system partition is 
unavailable . 

14. The method of claim 10, wherein the redundancy group can be redefined 
to include a different set of computing systems. 

15 . The. . . 

. . .method for providing database access, comprising the steps of: 

operating at least one computing system within a network, the computing 
system containing at least one computing system partition and the 

computing system being a member of a redundancy group, wherein the 
computing system partition includes at least one instance of an 
application, at least one computing system node, and at least one 
copy of a database schema, the copies of the database schema 
being replicated at each computing system partition within a 
network; and 

managing the replication of the database schema within the 

computing system and computing system partitions within the 
redundancy group. 
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ABSTRACT EP 990986 A2 

A method and apparatus for automatically redistributing tasks to reduce 
the effect of a computer outage on a computer network. The apparatus 
comprises at least one redundancy group comprised of one or more 
computing systems, comprised of one or more computing system 
partitions . The computing system partition includes copies of a 
database schema that are replicated at each computing system 
partition . The redundancy group monitors the status of the computing 
systems and the computing system partitions , and assigns a task to 
the computing systems based on the monitored status of the computing 
systems . 

ABSTRACT WORD COUNT: 94 
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. . .ABSTRACT outage on a computer network. The apparatus comprises at least 
one redundancy group comprised of one or more computing systems, 
comprised of one or more computing system partitions . The computing 
system partition includes copies of a database schema that are 
replicated at each computing system partition . The redundancy group 
monitors the status of the computing systems and the computing system 

partitions , and assigns a task to the computing systems based on the 
monitored status of the computing systems. 

...SPECIFICATION in a failure recovery system, characterized by: 

one or more computing systems connected together via a network, 
wherein each computing system comprises one or more computing system 
partitions each including at least one copy of a database schema, 
the copies of the database schema being replicated at each 
computing system partition within a network; 

at least one redundancy group comprised of the computing systems and 
the computing system partitions, wherein each redundancy group monitors a 
status . . . 

...from a computer failure, characterized by the steps of: 

operating one or more computing systems within a network, the 
computing systems comprising one or more computing system partitions 
each including at least one copy of a database schema, the copies 



...accordance with the present invention comprises at least one redundancy 
group comprised of one or more computing systems, which are comprised of 
one or more computing system partitions . The computing system 
partition includes copies of a database schema that are replicated 
at each computing system partition . The redundancy group monitors the 
status of the computing systems and the computing system partitions 
, and assigns a task to the computing systems based on the monitored 
status of the computing systems. 

The foregoing description of the preferred embodiment of. .. 

CLAIMS 1. A failure recovery system, characterized by: 

one or more computing systems connected together via a network, wherein 
each computing system comprises one or more computing system 
partitions each including at least one copy of a database 
schema, the copies of the database schema being replicated at 
each computing system partition within a network; 

at least one redundancy group comprised of the computing systems and the 
computing system partitions, wherein each redundancy group monitors a 
status . . . 

...from a computer failure, characterized by the steps of: 

operating one or more computing systems within a network, the computing 
systems comprising one or more computing system partitions each 
including at least one copy of a database schema, the copies of 
the database schema being replicated at each computing system 
partition within a network; 

configuring the computing systems into at least one redundancy group; 

monitoring a status of the computing systems and the computing system 
partitions within the redundancy group; and 

assigning. . . 

...computer network, characterized by the steps of 

operating one or more computing systems within the computer network, 

wherein the computing system includes at least one computing system 

partition , the computing system partition having at least one 
copy of a database schema; 
configuring the computing systems together via the computer network; 
configuring, within the computer network, at least one redundancy group, 
comprising one or more computing... 
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..SPECIFICATION support a greater load. In this case, the RDBMS access 
partition should run on each node where an RDBMS engine is located. Forte 
supports the replication of this RDBMS access partition and provides 
a router to allow multiple RDBMS engines to service the next RDBMS 
request in the application queue. The other reason for... 

..back-up node. For example, a node may provide a key analytical service 
or image for the user. In these cases it is possible to define a 
replicated partition as an alternate node that can be accessed in the 
event that the primary node is unavailable. Forte also provides a router 
that can access . . . 
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Fulltext Availability: 

Detailed Description 

Claims 

Fulltext Word Count: 5332 
English Abstract 

Systems and methods for automatically replicating database information. A 
subscription database (110) is queried by a database replication server 
(120) to obtain custom attributes defined in a plurality of custom 
database information subscriptions (115). The custom attributes for each 
database information subscription (115) include: (a) the identification 
of one or more master subscription databases (130), and (b). at least one 
operation to be performed on the one or more master subscription database 
to create a custom information database. The operation (s) to be performed 
can include merging database information contained in records (135) from 
two or more master subscription databases (130), partitioning database 
information contained in one or more master subscription databases (130) . 
The operations identified by the custom attributes for each custom 
database information subscription (115) are used to automatically 
generate custom information databases (140) containing preferred database 
information from the master subscription databases (130). 

French Abstract 

L' invention concerne des systemes et des procedes permettant de repliquer 
automatiquement des informations de bases de donnees . Un serveur de 
replication (120) de base de donnees demande a une base de donnees 
d'abonnement (110) d'obtenir des attributs personnalises definis dans une . 
pluralite d ' abonnements d ' informations (115) de base de donnees 
personnalisee . Les attributs personnalises de chaque abonnement 
d' informations (115) de base de donnees comprennent a) 1 ' identification 
d'au moins une base de donnees (130) d'abonnement maitre, et b) au moins 
une operation a executer sur la base de donnees d'abonnement maitre afin 
de creer une base de donnees d ' informations personnalisee. La ou les 
operation (s) a effectuer consistent a fusionner les informations de base 
de donnees contenues dans des enregistrements (135) a partir d'au moins 
deux bases de donnees (130) d'abonnement maitres, et a partitionner les 
informations de base de donnees contenues dans la base de donnees (130) 
d'abonnement maitre. Les operations identifiees par les attributs 
personnalises pour chaque abonnement d ' informations (115) de base de 
donnees personnalisee sont automatiquement utilisees pour creer des bases 
de donnees (140) d 1 informations personnalisees contenant des informations 



de base de donnees preferees provenant des bases de donnees (130) 
d ' abormement maitres. 

Legal Status {Type, Date, Text) 

Publication 20020117 A2 Without international search report and to be 

republished upon receipt of that report. 

Examination 20020801 Request for preliminary examination prior to end of 

19th month from priority date 

Main International Patent Class: G06F-017/00 
Fulltext Availability: 

Detailed Description 

Claims 

Detailed Description 

... Next, in Step 370, it is determined from the custom attributes whether 
the custom database information subscription requires a partition 
operation. If so, a partitioned copy of the identified Master 
Subscription Database 130 is created (Step 375) . Regardless of the need 
for partitioning , it is also determined whether the custom database 
information subscription requires a merging operation (Step 
380). If so, the identified databases are merged (Step 285); the merged 
databases can. . . 

...can also identify one or more filtering operations to be performed on a 
database (Step 0 380) . If a filtering operation is identified, the local 
copy of the identified Master Subscription Database 130 is filtered 
using the specified criteria (Step 395) ; the resulting database can be 
stored in a local database and/or in a publication database... 

Claim 

... of said 

custom database information subscriptions by performing said at least one 
operation on said database information stored in said one or more master 
subscription databases . 

10 The database replication server recited in Claim 9, wherein said 
merging, partitioning and filtering operations are definable on a 
record or record field basis 

11 The database replication server recited in Claim 9, wherein said act 
of 

generating said custom information database... 
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Main International Patent Class: G06F-017/30 
Publication Language: English 
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Fulltext Availability: 

Detailed Description 

Claims 

Fulltext Word Count: 17030 
English Abstract 

To facilitate accurate document searching, electronically accessible 
documents are provided with abstracts written in a highly constrained 
artificial grammar. The artificial grammar is capable of expressing the 
thoughts and information ordinarily conveyed in a natural grammar, but in 
a structured format that restricts the number of possible alternative 
meanings. Accordingly, while the grammar is clear in the sense of being 
easily understood by native speakers of the vocabulary and complex in its 
ability to express sophisticated concepts, sentences are derived from an 
organized vocabulary according to fixed rules. A query, preferably 
formulated in accordance with these rules, is employed by a search engine 
in the usual fashion. Due to the highly constrained meaning of the search 
query, and the likelihood that relevant documents have similar or 
matching abstracts in their headers, key-word searches are likely to 
identify the most relevant documents. 

French Abstract 

Afin de faciliter la recherche de documents, les documents 
electroniquement accessible sont pourvus de resumes rediges dans une 
grammaire artificielle extremement comprimee. La grammaire artificielle 
est capable d'exprimer la pensee et 1 ' information normalement exprimee 
par la grammaire naturelle, mais dans un format structure qui restreint 
le nombre de significations alternatives possibles. Par consequent, 
tandis que la grammaire est claire dans le sens qu'elle est facilement 
comprehensible par les locuteurs natifs du vocabulaire et complexe dans 
sa capacite d'exprimer des concepts sophistiques, les phrases proviennent 
d'un vocabulaire organise selon des regies fixes. Une requete, de 
preference formulee selon lesdites regies, est utilisee par un moteur de 
recherche de la maniere habituelle. En raison du sens extremement 
comprime de la requete, et la possibilite que des documents pertinents 
aient des resumes similaires ou analogues dans leur en-tete, la recherche 
de mots-cles est susceptible d' identifier les documents les plus 
pertinents . 

Legal Status (Type, Date, Text) 

Publication 20000727 Al With international search report. 

Examination 20001019 Request for preliminary examination prior to end of 

19th month from priority date 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Detailed Description 

Detailed Description 

. . . and the COMMUNICATOR and NAVIGATOR products 
supplied by Netscape Communications Corp. 

To support analysis module 225 (if included) , main memory 204 

may also include a partition defining a series of databases capable 

of 

storing the linguistic units of the invention; these are representatively 
de 

noted by reference numerals 2351, 2352, 2353i 2354... 
. . . col 

umns-the first containing the linguistic unit, the second containing a 
definition (if the linguistic unit has more than one meaning and is 
therefore replicated in the database ) , and the third containing a 



synonyms . 

An input buffer 240 receives from the user, via keyboard 210, an 
input sentence. Analysis module 225 examines the... 
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Detailed Description 

Claims 

Fulltext Word Count: 26601 
English Abstract 

An information storage system (100) includes a controller (116) for 
managing the resources of a common mass storage device (128) in order to 
enable multiple hosts {104, 104) connected to a common bus (106) to 
independently read and write to the mass storage device (128) in a 
relatively high speed manner on a first come, real time basis. In 
particular, a system of commands is provided which enables each host 
(104) to read and write to the mass storage device (128) on an 
independent, first come, real time basis by locking the requested address 
space irrespective of the origination. Even though an address storage 
space may be locked, the data within such space is always readable by 
another host (104). Should a subsequent host (104) issue a command to 
write to the locked address space, the command is aborted and a flag is 
set indicating to the subsequent requesting host (104) that the area is 
locked. 

French Abstract 

Un systeme de memorisation d * informations (100) comprend un controleur 
(116) pour gerer les ressources d'une memoire de masse commune (128) afin 
de permettre a un grand nombre d'hotes (104, 104) connectes a un bus 
commun (106) de lire et d'ecrire independamment dans la memoire de masse 
(128), avec une vitesse relativement elevee, en temps reel, et selon un 
systeme de premier arrive. En particulier, un systeme d ' instructions est 
prevu qui permet a chaque hote (104) de lire et d'ecrire dans la memoire 
de masse (128), selon un systeme en temps reel de premier arrive, en 
verrouillant 1* espace d'adressage demande quelle que soit son origine . 
Meme si un espace d'adressage peut etre verrouille, les donnees contenues 
dans cet espace peuvent toujours etre lues par un autre hote (104). 
Lorsqu'un autre hote (104) emet une instruction pour lire 1 ' espace 
d'adressage verrouille, 1 ' instruction est avortee et un drapeau est place 
pour indiquer a 1' autre hote (104} que la zone est verrouillee. 

Main International Patent Class: G06F-012/00 
International Patent Class: G06F-13:14 



Fulltext Availability: 
• Detailed Description 

Detailed Description 
. . . e.g. , 

audit space 146, data space 148, keys space 150, swap space 152 and pads 
2 0 space 154). After the pointers for the partitions 142 and 144 are 
determined , additional copies of the new database address table are 
stored in various 

protected memory storage areas, for example, one or both of the special 
purpose storage spaces 156 in step 532... 
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ABSTRACT EP 800135 Al 

There is disclosed a method and apparatus for controlling access to and 
corruption of information in a computer system. In known "PC Virus" 
protection methods the boot partition becomes "Read Only" when the system 
is in Supervised Mode. However, Microsoft Windows, although not strictly 
self-modifying, does require that certain files located within the 
Windows directory, can be written to. Accordingly the present invention 
provides a method of controlling access to and modification of 
information stored on a storage medium forming part of a computer system 
comprising: dividing information stored on the storage medium into a 
plurality of non-overlapping partitions including a boot partition and at 
least one general partition, characterised by: designating at least one 
of said partitions a Write Many Recoverable (WMR) partition wherein, in 
use, if a write command is issued to overwrite any resident information 
stored in a/the WMR partition by updating information is written on the 
storage medium in a location other than where the resident information is 
stored and a (virtual) pointer to the updated information is set up/kept 
so that the updated information can be accessed, as required during a 
remainder of a session, 

ABSTRACT WORD COUNT: 191 
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. CLAIMS a Sector Relocation Table (SRT) associated with it which table is 
held a Random Access Memory (RAM) of the Supervisor, each entry in a 
SRT defining the address of a range of sectors in the WMR 



partition that have been updated and an address where the updated 
information is located, this location being within the dedicated 
area . 

9. An apparatus for controlling... 
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ABSTRACT EP 780763 Al 

A software partitioning tool is disclosed. Based on a visual display of 
an application that shows program objects and the connections or 
interactions between the objects, an internal representation of the 
application is defined. As the user interacts with the visual display of 
the application, creating new partitions and relocating program objects 
in the new partitions, the internal representation of the display is 
constantly updated. Once a connection between program objects crosses 
partition boundaries, it is redefined in the internal representation as a 
distributed interaction (a connection), and representative server and 
client stubs are defined. At a number of points, the user also has the 
opportunity to set middleware protocols. Once the user commits to a 
distribution design, a code generator in the tool generates the actual 
server and client stubs for all distributed connections based on the 
definitions in the internal representation of the application. 
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. . .CLAIMS program object parts and connections between the objects; 
defining an internal representation of the displayed application; 
in response to user action, displaying at least one partition 

boundary and defining said at least one partition boundary in 

the internal representation; 
in response to user action, relocating, on the displayed application, at 

least one program object so that its connection with other program 



objects cross said at least one partition boundary and defining 
said connections as distributed connections in the internal 
representation; 

determining from said distributed connections server objects and client 

objects; and 
in response to a user. . . 
...comprising the computer-implemented steps of: 

initially defining an internal representation of the objects and 

connections of the displayed application design; 
in response to user definition of at least one partition boundary 

in the displayed application design, defining corresponding empty 

partition containers in the internal representation; 
in response to user relocation of at least one program object across 

said at least one partition... 

...visual representation of connections between the objects, the tool 
comprising : 

a metadata generator for defining a current internal representation of 
the displayed application and for defining any connections crossing 
partition boundaries in said displayed application as distributed 
connections in the current internal representation; and 

a code generator for generating distributed interfaces for all 
distributed connections defined... 

...internal representation of the objects and connections of the displayed 
application design; 

computer readable program code means for causing the computer, in 

response to user definition of at least one partition boundary 
in the displayed application design, to define corresponding empty 
partition containers in the internal representation; 

computer readable program code means for causing the computer, in. . . 
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ABSTRACT EP 378038 A2 

Any number of sorted lists are efficiently partitioned into P lists, 
where P represents the number of processors available to sort the 
resulting lists. When given a large list to sort, the list is initially 
divided into P lists, and each processor sorts one of these lists. The 
lists are then exactly partitioned so that each of the elements in the 
new consecutive partitioned lists have values no smaller than any of the 
elements in the lists before it, nor larger than any of the elements in 
the list following it. Partitioning is done by P-l processors. Each of 
the processors successively considers selected rows of elements from the 
sorted lists, and moves a partition boundary based on an element 
magnitude requirement and a partition size requirement. The new 
partitioned lists are then merged by the P processors, and simply strung 
together to provide a sorted list of all the elements. 

ABSTRACT WORD COUNT: 155 
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. . .CLAIMS consideration; 

b) fixing a partition boundary near the middle row of elements; 

c) determining the maximum value of all the elements under 
consideration above the partition boundary ; 

d) determining the elements under consideration below the 
partition boundary that are less than the maximum value; 

e) moving elements about the boundary based on the size... 

...of elements which should be moved to make the partitions the correct 
size; and 

moving the number of elements to be moved to make the partitions 
the correct size minus the number of elements determined in step 
d from below the boundary to above the boundary. 
9. A method of sorting a list of N elements using P processors, where 

...comprising the steps of: 

a) dividing the list into P sublists of approximately N/P elements; 

b) each processor sorting one of the sublists; 

c) defining P-l partition boundaries , each boundary being 
defined by one of P-l of the processors, said boundaries dividing 
the lists into nearly equal partitions of elements having values less 
than all of. . . 

...the presorted lists, each processor comprising: 

means for selectively and iteratively adding elements from the 
lists to a partitioning list; 

means for selecting an initial partition boundary for the 



partitioning list; 

means for determining a size modifier based on the desired 
number of elements above the partition boundary versus the actual 
number of elements above the partition boundary ; 

means for determining a magnitude modifier based on the number 
of elements below the partition boundary which are less than the 
magnitude of the largest element above the partition boundary. . . 

...the presorted lists, each processor comprising: 

means for selectively and iteratively adding elements from the 
lists to a partitioning list; 

means for selecting an initial partition boundary for the 
partitioning list; 

means for determining a size modifier; 

means for determining a magnitude modifier; and 

means for modifying the partition boundary as a function of the 
size modifier and magnitude modifier following each iterative 
addition of . , . 
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English Abstract 

A network-based storage system comprises one or more block-level storage 



servers (104) that connect to, and provide disk storage for, one or more 
host computers (102) over logical network connections (preferably TCP/IP 
sockets) 400. In one embodiment, each host (102) can maintain one or more 
socket connections (400) to each storage server (104), over which 
multiple I/O operations may be performed concurrently in a non-blocking 
manner. The physical storage of a storage server (104) may optionally be 
divided into multiple partitions, each of which may be independently 
assigned to a particular host (102) or to a group of hosts. Host driver 
software (204) presents these partitions to user-level processes as one 
or more local disk drives. When a host (102) initially connects to a 
storage server (104) in one embodiment, the storage server (104) 
initially authenticates the host, and then notifies the host (102) of the 
ports that may be used to establish data connections (400) and of the 
partitions assigned to that host (102) . 

French Abstract 

L' invention concerne un systeme de stockage base reseau, comprenant au 
moins un serveur de stockage de niveau bloc (104), lequel se connecte a 
un ou plusieurs ordinateurs hotes (102), sur des connexions reseau 
logiques (400) (de preference des prises TCP/IP), et permet le stockage 
disque pour ce ou ces ordinateurs. Dans un mode de realisation, chaque 
hote (102) peut conserver une ou plusieurs connexions de prise (400) avec 
chaque serveur de stockage (104), connexions par 1 ' intermediaire 
desquelles il est possible d'executer plusieurs operations d'E/S, de 
maniere concurrente, sans blocage. Le stockage physique d'un serveur de 
stockage (104) peut se decouper eventuellement en plusieurs partitions, 
chacune pouvant etre assignee de maniere independante a un hote en 
particulier (102) ou a un groupe d' hotes. Le logiciel de pilotage hote 
(204) presente ces partitions aux procedes niveau utilisateur, sous forme 
d ' une ou plusieurs unites de disques locaux. Dans un mode de realisation, 
lorsqu'un hote (102) se connecte d'abord a un serveur de stockage (104), 
le serveur de stockage (104) authentifie d'abord l'hote (102), puis 
indique a ce dernier les ports qui peuvent etre utilises aux fins 
d' etablissement de connexions de donnees (400), ainsi que les partitions 
assignees a cet hote (102). 
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Fulltext Availability: 
Claims 



Claim 

provides functionality for allocating a partition to multiple host 
computers to permit sharing of partitions. 

42 The storage server system of Claim 38, wherein the partitions have a 
user- definable size . 

43 The storage server system of Claim 38, wherein the software system 
supports the ability for a host computer to concurrently perform multiple 
inputioperations over. . . 
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Detailed Description 

Claims 

Fulltext Word Count: 19072 
English Abstract 

Methods and systems (1000) are provided for merging computer disk 
partitions to reduce the number of partitions (1010). Unlike conventional 
approaches that rely on FDISK, the invention does not destroy user data 
on the disk (1008) during or after the two or more partitions are merged. 
Two or more adjoining partitions may be combined. During a merging 
operation, partitions may have their clusters aligned (612) or resized 
(614) . The merging partitions may also have their partition type changed 
(606) . During the merge at least one copy of all system and user data of 
all partitions is kept on a disk at all times, reducing the risk of data 
loss . 

French Abstract 

L' invention concerne des procedes et des systemes (1000) destines a 
fusionner des partitions de disque afin de reduire le nombre de 
partitions (1010) . Contrairement aux approches tradit ionnelles qui 
reposent sur 1 f utilisation de FDISK, la presente invention ne detruit pas 
les donnees utilisateurs du disque (1008) durant ou apres la fusion des 
deux partitions ou plus. Deux partitions adjacentes ou plus peuvent etre 
associees. Au cours de 1' operation de fusion, les blocs des partitions 
peuvent etre alignes (612) ou redimensionnes (614). Le type de partition 
des partitions qui fusionnent est egalement modifie (606). Au cours de la 
fusion, au moins une copie de toutes les donnees du systeme et des 
donnees utilisateur de toutes les partitions sont conservees en 
permanence sur un disque, ce qui reduit ainsi le risque de perte de 
donnees . 

Legal Status (Type, Date, Text) 

Publication 20010322 Al With international search report. 

Examination 20010816 Request for preliminary examination prior to end of 

19th month from priority date 

Fulltext Availability: 
Claims 

Claim 

The method of claim 1, further comprising completing the merger of 
each secondary partition into the target partition to produce the merged 
partition, the merged partition having the determined cluster size 
and the determined partition type, the method being performed without 
destroying user data of the target partition except at user request and 
without destroying user data of any secondary. . .progress markers 
corresponding- to 

incrementally increasing portions of the merged partition production. 



32 The system of claim 27, wherein the system further comprises a cluster 



size checker which determines whether a partition needs to have its 
clusters resized, and the system further comprises a cluster resizer 
which resizes those clusters of the partition which the cluster size 
determiner has determined need to be resized. 

33 The system of claim 27, wherein the system preserves at least one copy 
of all system data of all merging... 
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Claims 

Fulltext Word Count: 21408 
English Abstract 

A method is described for allowing a user to select one of a plurality of 
items. The user employs a device having a display area, and a joystick or 
a contact sensitive area. The device displays a number of regions equal 
to the number of items, and defines a number of sections in the angular 
range of the joystick, or sections within the contact sensitive area, 
equal to the number of items, and arranged corresponding to the 
arrangement of the regions of the display area. The user selects one of 
said items by selecting the corresponding section. 

French Abstract 

L f invention a trait a un procede permettant a un utilisateur de 
selectionner un article parmi plusieurs . L ' utilisateur utilise un 
dispositif dote d ' une zone d'affichage et un module d ' instruct ion ou une 
zone sensible de contact. Le dispositif affiche un certain nombre de 
regions egales au nombre d' articles, et definit un certain nombre de 
sections dans la portee angulaire dudit module, ou de sections a 
l'interieur de la zone sensible de contact egales au nombre d' articles et 
disposees de maniere a correspondre a la disposition des regions de la 
zone d'affichage. L 1 utilisateur selectionne un desdits articles en 
choisissant la section correspondante . 
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Fulltext Availability: 

Claims 
Claim 

one of said 
sections . 

4 A method according to claim 3 in which the sections 
collectively cover the whole of the angular range, so 

that defining the sections is equivalent to partitioning 
the angular range . 

5 A method according to any preceding claim in which 
the user can (i) vary the selection of the item, 
information being displayed in relation. . . 
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COMPUTER SYSTEM AND METHOD FOR OPERATING MULTIPLE OPERATING SYSTEMS IN 
DIFFERENT PARTITIONS OF THE COMPUTER SYSTEM AND FOR ALLOWING THE 
DIFFERENT PARTITIONS TO COMMUNICATE WITH ONE ANOTHER THROUGH SHARED 
MEMORY 

SYSTEME ET PROCEDE INFORMATIQUES DE COMMANDE DE SYSTEMES D ' EXPLOITATION 
MULTIPLES DANS D I FFE RENTE S PARTITIONS DU SYSTEME INFORMAT I QUE ET 
PERMETTANT AUX DIFFE RENTES PARTITIONS DE COMMUNIQUER ENTRE ELLES PAR 
UNE MEMO I RE P ART AGE E 
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Claims 

Fulltext Word Count: 4 584 3 
English Abstract 

A computer system comprises a plurality of processing modules that can 
be configured into different partitions within the computer system, and a 
main memory. Each partition operates under the control of a separate 
operating system. At least one shared memory window is defined within the 
main memory to which multiple partitions have shared access, and each 



partition may also be assigned and exclusive memory window. Program code 
executing on different partitions enables those partitions to communicate 
with each other through the shared memory window. Means are also provided 
for mapping the physical address space of the processors in each 
partition to the respective exclusive memory windows assigned to each 
partition, so that the exclusive memory windows assigned to each 
partition appear to the respective operating systems executing on those 
partitions as if they all start at the same base address. 

French Abstract 

L ' invention concerne un systeme informatique qui comprend une pluralite 
de modules de traitement que 1 ' on peut configurer en differentes 
partitions dans le systeme informatique, et une memoire principale. 
Chaque partition fonctionne sous la commande d'un systeme d ' exploitation 
separe. Au moins une fenetre de memoire partagee est definie dans la 
memoire principale a laquelle plusieurs partitions ont un acces partage, 
et chaque partition peut aussi se faire attribuer une fenetre de memoire 
exclusive. L' execution d'un code programme dans differentes partitions 
permet a ces partitions de communiquer entre elles par la fenetre de 
memoire partagee. Cette invention concerne aussi des moyens permettant de 
projeter l'espace d'adresses physiques des processeurs dans chaque 
partition dans les fenetres de memoire exclusives respectives attribuees 
a chaque partition, de facon que les fenetres de memoire exclusives 
attribuees a chaque partition semblent toutes partir de la meme adresse 
de base pour les systemes d ' exploitation respectifs qui s'executent dans 
ces partitions. 

Fulltext Availability: 
Claims 

Claim 
. . . DATA 
i 1314 

RELEASE SUB-PODs FROM RESET 
(IDENTIFY BIOS SUB-PODS (BSPs)) 
1316 

INITIALIZE PCI 

BUSSES 

i 1318 

READ CONFIGURATION DATA OPTIONAL 

TO IDENTIFY PARTITIONS 

1320 

CALCULATE SIZE OF 

HIGH AND LOW 
MEMORY HOLES 
1322 

INFORM MANAGEMENT INTERFACE 
PROCESSOR (MIP) OF THE AMOUNT OF 
MEMORY-MAPPED 1/0 SPACE REQUIRED 
BY PCI CARDS. . . 
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Detailed Description 

Claims 

Fulltext Word Count: 7905 
English Abstract 

The invention is a storage device (1) for a host computer system. The 
device (1) incorporates a Supervisor function for controlling access to 
information stored in a storage medium (2) of the device. The main 
embodiment described is a hard disk drive (1) comprising: one or more 
disk platters (2) for storing information; a ROM (4) for storing firmware 
for controlling operation of the drive; a volatile RAM (5); a 
micro-controller (7) for controlling the transfer of information to and 
from the disk platter (s) (2); and an interface (6) for interfacing the 
drive (1) with the host computer system and via which information is 
transferred to and from the disk platter (s) (2) under the control of the 
micro-controller (7). A Supervisor is provided in the form of firmware 
which is preferably stored in the ROM (4), the Supervisor operating the 
micro-controller (7) so as to protect information stored on the disk 
platter ( s ) . 

French Abstract 

La presente invention concerne un dispositif de stockage (1) destine a 
un systeme informatique hote. Le dispositif de 1' invention (1) comprend 
une fonction de superviseur qui commande l'acces aux informations 
stockees sur un support de stockage {2) du dispositif. Dans le mode de 
realisation principal, le dispositif est compose d ' une unite de disque 
dur (1) comprenant: un ou plusieurs supports disques (2) destines a 
stocker les informations; une ROM (4) destinee a stocker les logiciels 
microprogramrnes qui commandent le f onctionnement de 1* unite; une RAM 
volatile (5); un microcont roleur (7) qui commande le transfert des 
informations depuis et vers le{s) support (s) disque (s) (2); et une 
interface (6) qui assure 1 ' inter facage entre 1'unite de disque (1) et le 
systeme informatique hote et via laquelle les informations sont 
transferees vers et depuis le(s) support (s) disque (s) (2) sous le 
controle du microcontroleur (7). Un programme superviseur, se presentant 
sous la forme d'un logiciel microprogramme stocke dans la ROM (4), assure 
le f onctionnement du microcontroleur (7) de facon a proteger les 
informations stockees sur le(s) support(s) disque(s) (2). 

Fulltext Availability: 
Claims 

Claim 

... is held in said volatile RAM means 

35 (5) of the storage device (1), and each entry in a said SRT is 
a pointer which defines the address of a range of sectors in 
the WMR partition that have been updated and an address where 
the updated information is located, this location being within 
a dedicated area on the storage medium (2... 
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METHOD AND APPARATUS FOR DYNAMIC QUEUE SIZING 

PROCEDE ET APPAREIL SERVANT A DIMENSIONNER DE FACON DYNAMIQUE DES FILES 
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Detailed Description 

Claims 

Fulltext Word Count: 4 903 
English Abstract 

A system and method for dynamically resizing queues (1023) used in a 
network switch (210) to accommodate potential congestion situations 
without experiencing data loss. In one embodiment, partiton pointer 
registers (410) are used to indicate when resizing is desirable. The 
control logic (405) then determines when it is safe to update the size of 
the queue such that no data loss occurs and timely updates the queue 
size . 

French Abstract 

L' invention concerne un systeme et un procede servant a redimensionner 
de facon dynamique des files d'attente (1023) dans un centre de 
commutation (210) de reseau, afin de faire face a des situations 
potentielles de congestion sans pertes de donnees . Dans un mode de 
realisation, des registres (410) de pointeurs de partition sont utilises 
pour indiquer quand un redimensionnement est souhaitable. La logique 
(405) de commande determine ensuite le moment le plus sur pour mettre a 
jour la dimension de la file d'attente afin qu'il n'y ait aucune perte de 
donnees, et effectue la mise a jour de la file d'attente au bon moment. 

Fulltext Availability: 

Claims 
Claim 

one queue; 

for each partition boundary to be updated; 

checking states of the at least one queue affected by movement 
of the location of the partition boundary ; 

determining when it is safe to move each location of the 

partition boundary ; and 
when it is determined that it is safe to move the location of the 

1 I partition boundary, updating the partition boundary to the updated 
location . 

2 The method. . . 

...not located in the area affected by movement of the partition boundary. 
5 An apparatus comprising: 

a memory comprising at least one queue, each queue defined by 

partition boundaries ; 
at least one partition pointer, each partition pointer identifying a 
location of a partition boundary; and 

control logic coupled to the memory and the at least one partition 
pointer... to indicate an updated location of the partition 
boundary; 

checking states of at least one queue affected by movement of the 
10 location of the partition boundary ; 

I I determining when it is safe to move the location of the partition 

12 boundary ; and 

19 

when it is determined that it is safe to move the location of the 
partition boundary, updating the partition boundary to the updated 
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JP 2003167881 A 20030613 JP 2001370294 A 20011204 200347 B 

Priority Applications (No Type Date) : JP 2001370294 A 20011204 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 2003167881 A 6 G06F-017/30 

Abstract (Basic) : JP 2003167881 A 

NOVELTY - A display unit displays an electronic map of a designated 
area, which is divided into several partitions. The databases (2-6) 
store statistical data including population, number of public 
institutions, etc., corresponding to each partitioned area of the 
map. A determination unit determines a proposed site for erecting a 
shop based on the statistical data corresponding to each partitioned 
area on the map. 

USE - Used in business applications for collecting statistical 
information such as number of public institutions such as schools, 
number of residents and population composition in designated area, for 
determining proposed site for erecting shops. 

ADVANTAGE - Automatically performs area analysis to determine a 
suitable erection site efficiently. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the statistical data collection system. (Drawing includes non-English 
language text) . 

information process (1) 
statistical information databases (2-6) 
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Database administration and replication method involves storing 
statistics for each of database sampled records to perform 
extrapolated replication partition analysis operation on database 

Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 

Inventor: HARPER J W; SLISHMAN G R 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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Abstract (Basic): US 20030004973 Al 

NOVELTY - The database records are randomly sampled using a 
random sampling facility (26) which is integrated within a database 
management system (14). The statistics for each of the sampled 
records are stored, based on which an extrapolated replication 
partition analysis operation on the database , is performed. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for 
database management system. 

USE - For administration and replication of database storing 
business setting information, individual and corporate accounts, etc. 

ADVANTAGE - Approximation partition analysis is performed 
without straining or otherwise compromising computer system resources. 
The integrated sampling facility reduces number of system calls 
required for performing the analysis and also enables rapid access to 
records being retrieved. 

DESCRIPTION OF DRAWING (S) - The figure shows a block diagram of 
computer system. 

database management system (14) 

random sampling facility (26) 
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Database partition boundary determination method in information 

system, involves sampling records of database using random number 

algorithms, which are added or deleted from database 
Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 
Inventor: HARPER J W; SLISHMAN G R 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030004944 Al 20030102 US 2001897853 A 20010702 200333 B 

Priority Applications (No Type Date) : US 2001897853 A 20010702 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030004944 Al 10 G06F-007/00 

Abstract (Basic) : US 20030004944 Al 

NOVELTY - A particular number defining a desired sample size is 
selectively received to provide a seed value for initializing a random 
number algorithm. The records of a database (10) which are randomly 
sampled using the algorithm, are added or deleted from the database . 

Statistics for each record including a record key is stored to 
produce an approximation partition analysis . 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for 
database partition boundary determination system. 

USE - For databases in information system for business 
application. 

ADVANTAGE - Enables obtaining accurate analysis for dynamically 
changing databases even though approximation partition analysis 



is not mathematically exact. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the computer system, 
database (10) 
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Characterizing S by the identification of textual and physical structured 
query fragments/ useful for the analysis of textual and biopolymer 
information 

Patent Assignee: US DEPT HEALTH & HUMAN SERVICES (USSH ) 
Inventor: BOISSY R J 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20020177138 Al 20021128 US 2000248541 P 20001115 200331 B 
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Priority Applications (No Type Date) : US 2000248541 P 20001115; US 

2001991013 A 20011114 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 20020177138 Al 118 C12Q-001/68 Provisional application US 2000248541 

Abstract (Basic) : US 20020177138 Al 

NOVELTY - Characterizing (Ml) a set of strings (S) comprising 
receiving S with process-pattern containing substrings, defining series 
of search target S patterns effective for searching S and processing 
through an ordered series of search steps, each step being specific for 
one search class and an attempted discovery of an appropriate search 
target site to define a delimited search region for the next step, 
thereby characterizing S, is new. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for: 

(1) analyzing (M2) a set of polynucleotides {PN's), comprising 
identifying electronic structured query fragments using M and isolating 
physical structured query fragments, where the isolating comprises 
providing the set of PN's and isolating physical structured query 
fragments within the set of PN's by isolating fragments that remain 
after processing the set of PN's through a series of step-wise 
delimitation processes comprising cleaving the set of PN's with a 
cleavage effector to form a set of PN fragments including target PN 
fragments, and retaining only the target PN fragments for a next 
preemptive cleavage according to each recognition site pattern of the 
series of recognition site patterns, and comparing the electronic 
structured query fragments to the physical structured query fragments, 
thereby analyzing the set of PN's; 

(2) isolating and characterizing (M3) a set of PN's using M2; 

(3) characterizing (M4) using Ml and defining a process for 
identifying the process-pattern containing substrings based on a 
selected arrangement of search targets within a search target string 
pattern, and performing the process to identify the process-pattern 
containing substrings within the S for each search target pattern in 
the series of search target patterns, thereby characterizing the S; 

(4) characterizing (M5) sets of strings, comprising receiving one 
or more sets of strings of any length, where may be found occurrences 



of relatively short search-target-strings of interest, and where one or 
more of the short search- target-strings are used to define a distinct 
search target, and where several distinct search targets or targets are 
assembled into structured entities known as search target groups, where 
a search target group is comprised of a partition search target that is 
used to partition the sets of strings under study into substrings or 
partition fragments bounded by consecutive occurrences of the partition 
search target, and a small array of a limited number M of major classes 
or ordered sets of search targets, where each major class is comprised 
of a limited number of ranked member search targets, and where a search 
target group of target group, of two or more search target groups or 
target groups of distinct composition of structure, may be used to 
characterize search target group-defined substrings found within the 
sets of strings under study, using the structure and composition of a 
search target group with M major classes to define a search process 
comprised of a series of M search steps that are to be effected within 
each of the partition fragments obtained, from the sets of strings 
under study, using the partition search target of the target group, and 
where the search process defines patterns, of occurrence within the 
partition fragments of search targets that are members of the target 
group, and where partition fragments or regions therein may be 
characterized by the occurrence of instances, of the process patterns 
that may be defined by the structure and composition of the target 
group, and using the structure and composition of a search target group 
with M major classes to effect a search process comprised of a series 
of M search steps within each of the partition fragments obtained, from 
the sets of strings under study, using the partition search target of 
the target group, and where the search process results in the detection 
of process-pattern entities, where each process-pattern entity is 
comprised of a pattern of M search target sites, which together include 
a search target site representing one member of each of the M major 
classes in the target group, and where each of the sites must be 
present and where sites representing higher-ranked members of the same 
major class must be absent within the relevant search area for the 
major class in the partition fragment, and where the process pattern 
entities are obtained as a result of a stepwise search and delimitation 
process after each site is found that restricts the region of the 
partition fragment where the next class-specific target-search occurs, 
and where partition fragments or regions therein may be characterized 
by the occurrence therein of process-pattern entities, where the 
process-pattern entities represent instances of the process-patterns 
that may be defined by the structure and composition of the target 
group, and where partition fragments or regions therein may be 
characterized by the occurrence therein of structured query fragments 
(SQFS) that are fragments bounded any two search target sites in a 
process- pattern entity, and whose lengths can be calculated by the 
positions of the constituent sites that comprise the process-pattern 
entity wherein the SQFs are found, and where the SQFs of particular 
interest are typically the SQFs bounded by the last two search target 
sites detected in the identification of a process-pattern entity; and 

(5) physical characterization (M6) of a sample of PN's of the 
same general type. 

USE - Ml to M6 are useful for identifying, classifying, comparing, 
generating and/or separating fragments derived from one or more 
physical samples of PN's. They can also be used in computational and 
laboratory methods and databases for analyzing textual and biological 
sequence information. 
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Management device for partition table of database has addition section 
which replaces existing partition of partition table of database based 
on extracted partition definition information 

Patent Assignee: NEC CORP (NIDE ) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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Abstract (Basic) : JP 2002041333 A 

NOVELTY - A definition information extraction section obtains the 
partition definition information used in a new partition to be 
replaced and added to the existing partition of the partition table of 
a database . An addition section replaces the existing partition of 
the partition table of the database with the new partition based on 
the extracted partition definition information. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for a 
database partition table management method. 

USE - For partition table of database 

ADVANTAGE - Enables automatic replacement and addition of new 
partition to existing partition of partition table of database . 
Maintains improvement of a disc in input-output performance by 
maintaining distribution condition of partition, thereby preventing 
data overflow of a predetermined area. 

DESCRIPTION OF DRAWING (S) - The figure shows the sample of script 
production of a partition addition. (Drawing includes non-English 
language text) . 
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Restoration of database in a computer, involves applying modifications in 
log file to copied objects, including table index and partition index, 
during one pass through log file 

Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 
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Abstract (Basic) : US 6119128 A 

NOVELTY - The method involves copying objects, including the table 
index and partitioning index, from the secondary data storage device to 
the database on the primary data storage device after receiving a 
recovery indicator. Modifications in the log file are applied to the 
copied objects, including the table index and partitioning index, 
during one pass through the log file. 

DETAILED DESCRIPTION - The method begins by copying different 
objects in the database from the primary data storage device to the 
secondary storage device, in which the table index is used to locate 
data in a table while the partitioning index defines the scope of 
each partition and assigns a row of the table to respective 
partition. Modifications to the objects are logged in the log file. The 
recovery indicator shows the required recovery of objects in the 
database. INDEPENDENT CLAIMS are also included for the following: 

(a) the restoration apparatus used on the database of a computer; 

(b) and the manufacture of the computer program carrier used in 
database restoration. 

USE - Used in computer-implemented database systems and in 
recovering different types of objects with one pass of the log. 

ADVANTAGE - Provides recovery for partitions, partitioning indexes 
and table indexes simultaneously. Requires only one pass of log file to 
apply modifications to database. 

DESCRIPTION OF DRAWING (S) - The figure shows the recovery system 
for database in computer. 
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Provision method for continuous database service and scalable query 
performance uses active redundant copies , redundancy group and several 
computing system partitions , each group manages database schema 
replication in partitions 

Patent Assignee: NCR INT INC (NATC }; NCR CORP 

Inventor: ANTOUN S Z; BLEVINS T J; DEMPSTER P B; 
ROBINSON I M; STELLWAGEN R G 
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Patent Details: 
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Abstract (Basic) : EP 992909 A2 

NOVELTY - The method has several computing systems connected 



together via a network each comprising one or more computing system 
partitions . A redundancy group has a computing system and several 
computing system partitions , with each redundancy group managing the 

replication of the database schema within the computing system 
and computing system partitions within the redundancy group. 

USE - For the provision of continuous database service and scalable 
query performance using active redundant copies. 

ADVANTAGE - Provides a system with reasonable development costs and 
implementation schedules that does not sacrifice the benefits of open 
systems . 

DESCRIPTION OF DRAWING (S) - The drawing shows a block diagram of 
the hardware environment that could be used, 
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Replicated object management method for hierarchical network database - 
involves determining object IDs of target and its parent objects, and 
combining object IDs to form database-wide object ID 
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Abstract (Basic) : US 5832487 A 

The method involves obtaining a replica ID (102) which identifies 
the replica relative to other replica in database . The replica 
ID and an integer value (104) are used to form a partition-wide object 
ID for target object. 

The integer value is calculated by event counter value (108), 
pseudo-random value (110), time stamp value (112), GUID value (114). 
The partition -wide object ID is determined for each parent object 
of target object. The parent and child object IDs are combined to form 
database-wide object ID. 

USE - In distributed digital network. 

ADVANTAGE - Unique identifier of database is not updated during 
updatipn of object name. Does not allow distinct object to have same 
IDs. 
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Computer system for object identity and partitioning for user defined 
extents , has computer program with schema mapper for mapping between 

object attributes and fields in database table 
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Abstract (Basic) : US 6341289 Bl 

NOVELTY - The computer system (100) has a computer program stored 
in a memory (120) and executed by a processor (110). The computer 
program includes a schema mapper for mapping between object attributes 
and fields in a database table. The schema mapper defines the source 
of a partitioning key value and the partitioning key field in the 
database table for storing the partitioning key value. 

DETAILED DESCRIPTION - The partitioning key value identifies the 
partition containing the object within a class of objects. The 
partition also defines the subclass of objects with the class. 
INDEPENDENT CLAIMS are also included for the following: 

(a) the computer program; 

(b) and the mapping method between objects and database table 
used to persistently store objects. 

USE - For object identity and partitioning for user defined 
extents . 

ADVANTAGE - Allows transparent and flexible partitioning of created 
objects. Allows queries to be performed against partition without 
requiring user to have any specific knowledge of the partitioning 
structure. Provides customization and extension quality of framework 
mechanisms that are valuable to framework consumers because the cost of 
customizing or extending a framework is much less than the cost of 
replacing or reworking an existing solution. Allows maximum flexibility 
in application development and deployment. 

DESCRIPTION OF DRAWING (S) - The figure shows the schematic view of 
the computer system. 

Computer system (100) 

Processor (110) 

Memory (120) 
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'Repartitioning of data stored in direct access storage device connected 
to computer, involves reorganizing identified partitions based on altered 
partitioning scheme by moving data between identified partitions 
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Abstract (Basic): US 6125370 A 

NOVELTY - The adjacent partitions of data that would be affected by 
the altered partitioning scheme, are identified. Access to each 
identified partition is restricted, without restricting access to other 
partitions. The identified partitions are reorganized based on the 
altered partitioning scheme, by moving data between the identified 
partitions, while allowing access to other partition. 

DETAILED DESCRIPTION - The change in partitioning scheme for data 
is detected during alteration of partitioning index specifying one or 
more partitions and a limit key for each partition . The limit key 
defines a range of values for the partition . The adjacent 
partitions affected by the altered partitioning scheme are identified 
and reorganized. An INDEPENDENT CLAIM is also included for data 
repartitioning apparatus. 

USE - For data repartitioning in relational databases stored in 
direct access storage devices such as hard disk drive, tape drive, 
floppy disk drive connected to computer. 

ADVANTAGE - Because the rebalancing of data is limited to the 
affected partition, the repartitioning system provides a technique for 
rebalancing a subset of partitions without restricting access to 
unaffected partitions. Enables shifting of data among partitions based 
on the changed partitioning scheme, reliably, 

DESCRIPTION OF DRAWING (S) - The figure shows the flow diagram 
illustrating the process sequence involved in data repartitioning 
method . 
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Information handling system for multiprocessor database - has workfile 
disks which are logically partitioned into multiple groups and shared by 
logical processors which separately execute mergesort operation 
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Abstract (Basic) : US 5671405 A 

The system includes several logical processors each of which 
operates under control of an image of an operating system program. A 
database storage system stores data in a predetermined data structure 
and one or more workfile storage disk store workfiles during mergesort 
operations. One or more workfile storage disks are shared by one or 
more logical processors. Mergesort operations are executed on separate 
logical processors. 

The execution involves sorting the data structure into one or more 
ordered runs and determining a logical partition size of the 
workfile storage disks. A least loaded partition is selected and one or 
more ordered runs are written into the workfile storage disks in the 
selected partition. The ordered runs are merged into a single sorted 
run . 

ADVANTAGE - Processes efficient concurrent mergesorts. Allows 
dynamically choose less loaded partition to achieve benefits of load 
balancing . 
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Still-picture registration processor for still-picture database 
applications - in which image registering part registers information on 
intersection of each block based on representation colour 
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JP 9016614 A 9 G06F-017/30 

Abstract (Basic) : JP 9016614 A 

The still-picture registration processor divides the still- picture 
data (2) into several blocks separated by boundary lines, using a 
block partitioning part (3). A border-line calculating part (4) 
computes a outline-representing point, which is obtained as the 
intersection of the boundary line and border line of an image, for 
every boundary-line of each block. When the outline-representing point 
is obtained at two or more places, a border line simplifying part (5) 
selects the representation indication of two points from the 
outline-representing part. 

A colour calculating part (6) divides each block into two areas 
using linear approximation which connects the intersections and 
computes the allowed colour specification for each area. An image 
registering part (7) registers the information (8) on the intersection 
of each block and on each representation colour. 

ADVANTAGE - Simplifies display of still-picture data. Enables easy 
distinction of data. Improves image characteristics. Enables easy 
digital signal processing and provision of hardware. Improves 



operativity. Enables high speed processing. 
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Buffering packets in digital communications system in order to fairly 
distribute unused buffer space between connections and traffic flow 
groups 
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Abstract (Basic) : CA 2273291 Al 

NOVELTY - A hierarchy of memory partitions is defined , where each 
partitions consists of child partitions . The size of the top 
level partitions is set, whilst the nominal partition size for 
the child partitions is dynamically computed based on the 
congestion of each given child memory partition. The final step is 
iterated until all the partition sizes have been set. 

USE - For digital communications system e.g. ATM network. 

ADVANTAGE - Fairly distributes unused buffer space between 
connections and traffic flow groups. 

DESCRIPTION OF DRAWING ( S ) - The drawing shows a schematic diagram 
of the memory hierarchy in the buffer. 
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Optimized runtime communication processing in co-/multisimulation 
environment, involves limiting synchronizations between solvers to 
situation in which simulation is performed based on provided event 
information 
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Abstract (Basic) : US 6108494 A 



NOVELTY - The optimized direction information of boundary nets 
(425) that satisfy partitioning rules, is determined . The solvers 
are synchronized based on determined optimized direction information, 
in response to event information provided by first solver. The 
synchronizations between first and other solvers are limited to 
situations in which simulation is performed by each solvers depending 
on event information. 

DETAILED DESCRIPTION - The design source (415) is read upon which 
first simulator operates. The desired source defines several cells 
representing a design of the system or a portion of it. The two or more 
instances of a cell is identified, where respective subset of instances 
containing one instance but not all of the cell's instances is assigned 
to a predetermined solvers based upon the set of partitioning rules. 
The edited design source corresponding to a partition to be stimulated 
by the first solver and netlist information in format understandable by 
second solver are generated. The edited design source includes modified 
cell description of a parent cell with which the cell is associated. 
The directions associated with boundary nets are accumulated based on 
corresponding netlist information. Each boundary net has a direction 
associated with each design partition it connects. An INDEPENDENT CLAIM 
is also included for optimized runtime communication processing 
program. 

USE - For use in co-/mult isimulation environment and electronic 
design automation. 

ADVANTAGE - Since the synchronizations between solvers are limited 
to situations in which simulation is performed depending on event 
information from another solver, the runtime is optimized by avoiding 
unnecessary synchronizations during the simulation session. Increases 
runtime performance of co-/mult isimulation environment by reducing 
number of connections and traffic between simulators. 

DESCRIPTION OF DRAWING (S) - The figure shows the overview of N-way 
co-/simulation process. 

Design source (415) 

Boundary net (425) 
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Document partitioning fractionation method in digital computer for 
nonhierarchical , linear-time partitioning of corpus of documents by 
determining partitioning of desired size from ordering 
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Abstract (Basic): EP 980043 A2 

NOVELTY - The method involves preparing an ordering of a corpus by 
determining a partitioning of a desired size from the ordering, 
and the partitioning is further refined. 

DETAILED DESCRIPTION - Rather than attempting to express this 
information need as a formal query, the user instead selects a number 
of the top-level clusters (22A-I) that, from their description, seem 
relevant to the topic of interest. In this case, the user selects the 
clusters ( 22A, 22C, 22H ) labeled 'military history', 'science and 
industry', and 'American society' to form a reduced corpus (24) of the 
indicated subset of articles from Grolier's. In the example the cluster 
labels are idealized. 

USE - In a document-clustering-based browsing procedure for a 
corpus of documents, which is applicable over all natural languages 
that contain a lexical analysis capability. 

ADVANTAGE - Transforms the geometric structure into the logical 
structure, which represents the semantics carried by the documents. A 
virtual field separator technique is employed to utilize information 
carried by a special constituent of documents such as field separators 
and frames, keeping the number of transformation rules small. 

DESCRIPTION OF DRAWING (S) - The drawing is an illustrative diagram 
of preferred embodiment of the Scatter-Gather document browsing method 
according to an embodiment of the present invention. 

reduced corpus (24) 

top-level clusters (22A-I) 
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Abstract (Basic) : US 5974567 A 

NOVELTY - An user partition is created. A ghost partition 
overlapping the user partition is created. Diagnostic software is 
transferred to ghost partition. The ghost partition is disabled after 
determining that ghost partition is not required. 



DETAILED DESCRIPTION - Ghost partition is adapted to contain 
diagnostic software and download verification software. Maximum and 
minimum partition size is determined for the ghost partition 
and ghost partition size is allocated appropriately. An INDEPENDENT 
CLAIM is also included for partition creation and deletion program 
storage device. 

USE - In data storage device with diagnostic system e.g. 
multiplatter disk drive. 

ADVANTAGE - Avoids need to set up large system partition and avoids 
wasting disk space associated with manufacturing diagnostic, as no 
master boot records for any of user or system partition is modified 
during processing. 

DESCRIPTION OF DRAWING (S ) - The figure shows the flow chart 
illustrating partition creation and deletion method. 
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Automatic size calculation method for partition members in rooms - 
involves computing size of partition member by selecting one among 
basic specification stored in microcomputer, corresponding to measured 
size of room space 
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Abstract (Basic) : JP 9203225 A 

The method uses a laser type distance measuring equipment (1) which 
is connected to a microcomputer (2). The size of room space is measured 
using a laser beam irradiated from the distance measuring equipment. A 
number of partition member basic specifications are stored in the 
microcomputer . 

The size of a partition member is computed by selecting one 
among the number of stored basic specifications corresponding to 
measured size of room space. 

ADVANTAGE - Enables to deduce size of each partition member 
automatically based on size of room. Shortens measurement time. 
Avoids generation of mistake in distance measurement work. 

Dwg. 1/3 

Title Terms: AUTOMATIC; SIZE; CALCULATE; METHOD; PARTITION; MEMBER; ROOM; 

COMPUTATION; SIZE; PARTITION; MEMBER; SELECT; ONE; BASIC; SPECIFICATION; 

STORAGE; MICROCOMPUTER; CORRESPOND; MEASURE; SIZE; ROOM; SPACE 
Derwent Class: Q43; Q4 6; S02; T01 
International Patent Class (Main) : E04H-001/00 

International Patent Class (Additional): E04B-002/74; G01B-011/00; 

G01C-005/00; G06F-015/02 
File Segment: EPI; EngPI 



17/5/16 (Item 14 from file: 350) 



. • DIALOG (R) File 350:Derwent WPIX 

(c) 2003 Thomson Derwent . All rts. reserv. 

011344485 **Image available** 

WPI Acc No: 1997-322390/199730 

XRPX Acc No: N97-266753 
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Abstract (Basic): EP 780763 A 

The object oriented application creation method involves displaying 
application layout illustrating object parts and links. The application 
is defined internally. At least one partition boundary is 
displayed and represented internally in response to user action. 

At least one program object is relocated so that it's connection 
with other objects cross at least one partition boundary and 
defining the connections as distributed in the internal connection. 
Client and server objects are determined from the distributed 
connections. In response to a user commit action server code structure 
is generated with a distributed interface for each server. A client 
stub is generated with the distributed interface for each client part 
corresponding to each server. 

ADVANTAGE - Allows effective utilisation of network resources. 
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